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PREFACE 


This book is the result of a sequence of two courses given in the School of Applied 
and Engineering Physics at Cornell University. The intent of these courses has been 
to cover a number of intermediate and advanced topics in applied mathematics that 
are needed by science and engineering majors. The courses were originally designed 
for junior level undergraduates enrolled in Applied Physics, but over the years they 
have attracted students from the other engineering departments, as well as physics, 
chemistry, astronomy and biophysics students. Course enrollment has also expanded 
to include freshman and sophomores with advanced placement and graduate students 
whose math background has needed some reinforcement. 

While teaching this course, we discovered a gap in the available textbooks we felt 
appropriate for Applied Physics undergraduates. There are many good introductory 
calculus books. One such example is Calculus and Analytic Geometry by Thomas and 
Finney, which we consider to be a prerequisite for our book. There are also many good 
textbooks covering advanced topics in mathematical physics such as Mathematical 
Methods for Physicists by Arfken. Unfortunately, these advanced books are generally 
aimed at graduate students and do not work well for junior level undergraduates. It 
appeared that there was no intermediate book which could help the typical student 
make the transition between these two levels. Our goal was to create a book to fill 
this need. 

The material we cover includes intermediate topics in linear algebra, tensors, 
curvilinear coordinate systems, complex variables, Fourier series, Fourier and Laplace 
transforms, differential equations, Dirac delta-functions, and solutions to Laplace’s 
equation. In addition, we introduce the more advanced topics of contravariance and 
covariance in nonorthogonal systems, multi-valued complex functions described with 
branch cuts and Riemann sheets, the method of steepest descent, and group theory. 
These topics are presented in a unique way, with a generous use of illustrations and 
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graphs and an informal writing style, so that students at the junior level can grasp and 
understand them. Throughout the text we attempt to strike a healthy balance between 
mathematical completeness and readability by keeping the number of formal proofs 
and theorems to a minimum. Applications for solving real, physical problems are 
stressed. There are many examples throughout the text and exercises for the students 
at the end of each chapter. 

Unlike many text books that cover these topics, we have used an organization that 
is fundamentally pedagogical. We consider the book to be primarily a teaching tool, 
although we have attempted to also make it acceptable as a reference. Consistent 
with this intent, the chapters are arranged as they have been taught in our two course 
sequence, rather than by topic. Consequently, you will find a chapter on tensors and 
a chapter on complex variables in the first half of the book and two more chapters, 
covering more advanced details of these same topics, in the second half. In our 
first semester course, we cover chapters one through nine, which we consider more 
important for the early part of the undergraduate curriculum. The last six chapters 
are taught in the second semester and cover the more advanced material. 

We would like to thank the many Cornell students who have taken the AEP 
321/322 course sequence for their assistance in finding errors in the text, examples, 
and exercises. E.A.W. would like to thank Ralph Westwig for his research help and 
the loan of many useful books. He is also indebted to his wife Karen and their son 
John for their infinite patience. 


Bruce R. KussE 
Erik A. WESTWIG 


Ithaca, New York 
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A REVIEW OF VECTOR AND 
MATRIX ALGEBRA USING 
SUBSCRIPT/SUMMATION 
CONVENTIONS 


This chapter presents a quick review of vector and matrix algebra. The intent is not 
to cover these topics completely, but rather use them to introduce subscript notation 
and the Einstein summation convention. These tools simplify the often complicated 
manipulations of linear algebra. 


1.1 NOTATION 


Standard, consistent notation is a very important habit to form in mathematics. Good 
notation not only facilitates calculations but, like dimensional analysis, helps to catch 
and correct errors. Thus, we begin by summarizing the notational conventions that 
will be used throughout this book, as listed in Table 1.1. 


TABLE 1.1. Notational Conventions 


Symbol Quantity 

a A real number 

B A complex number 

Aj A vector component 

M4j...; A matrix or tensor element 

[M] An entire matrix 

A A vector 

é A basis vector 

T A tensor 

£ An operator 
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A three-dimensional vector V can be expressed as 
V = V,é, + Vyé, + V8, (1.1) 


where the components (V,, V,,V,) are called the Cartesian components of V and 
(@,, éy, €,) are the basis vectors of the coordinate system. This notation can be made 
more efficient by using subscript notation, which replaces the letters (x, y, z) with the 
numbers (1, 2, 3). That is, we define: 


é = é, Vi = Vy 

@&=é and W=V, (1.2) 

é& = é Ve=ev, 
Equation 1.1 becomes 

Vv = Vie, + V2€, + V3é3, (1.3) 
or more succinctly, 

V= 3_ Ve. (1.4) 
i=1,2,3 


Figure 1.1 shows this notational modification on a typical Cartesian coordinate sys- 
tem. 

Although subscript notation can be used in many different types of coordinate 
systems, in this chapter we limit our discussion to Cartesian systems. Cartesian 
basis vectors are orthonormal and position independent. Orthonormal means the 
magnitude of each basis vector is unity, and they are all perpendicular to one another. 
Position independent means the basis vectors do not change their orientations as we 
move around in space. Non-Cartesian coordinate systems are covered in detail in 
Chapter 3. 

Equation 1.4 can be compacted even further by introducing the Einstein summation 
convention, which assumes a summation any time a subscript is repeated in the same 
term. Therefore, 


V=- > Vie; = Vi8;. (1.5) 


Figure 1.1 The Standard Cartesian System 
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We refer to this combination of the subscript notation and the summation convention 
as subscript/summation notation. 
Now imagine we want to write the simple vector relationship 


C=A+B. (1.6) 


This equation is written in what we call vector notation. Notice how it does not 
depend on a choice of coordinate system. In a particular coordinate system, we can 
write the relationship between these vectors in terms of their components: 


C, = Ay + B, 
Cz, = A2 + By (1.7) 
C3 = A3 + B3. 


With subscript notation, these three equations can be written in a single line, 
C; = Aj + B;, (1.8) 


where the subscript i stands for any of the three values (1, 2,3). As you will see 
in many examples at the end of this chapter, the use of the subscript/summation 
notation can drastically simplify the derivation of many physical and mathematical 
relationships. Results written in subscript/summation notation, however, are tied to 
a particular coordinate system, and are often difficult to interpret. For these reasons, 
we will convert our final results back into vector notation whenever possible. 

A matrix is a two-dimensional array of quantities that may or may not be associated 
with a particular coordinate system. Matrices can be expressed using several different 
types of notation. If we want to discuss a matrix in its entirety, without explicitly 
specifying all its elements, we write it in matrix notation as [M]. If we do need to 
list out the elements of [M], we can write them as a rectangular array inside a pair of 
brackets: 


My, My --: Mi 
Mx, My :+-- My 

[M]= | . : : ill (1.9) 
M, M,2 rieeg MM, 


We call this matrix array notation. The individual element in the second row and 
third column of [M] is written as M23. Notice how the row of a given element is 
always listed first, and the column second. Keep in mind, the array is not necessarily 
square. This means that for the matrix in Equation 1.9, r does not have to equal c. 
Multiplication between two matrices is only possible if the number of columns 
in the premultiplier equals the number of rows in the postmultiplier. The result of 
such a multiplication forms another matrix with the same number of rows as the 
premultiplier and the same number of columns as the postmultiplier. For example, 
the product between a 3 X 2 matrix [M] and a 2 X 3 matrix [N] forms the 3 X 3 matrix 
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[P], with the elements given by: 


My Mn Be N12 | 


My, Ma 
No No WN. 
M3, M39 21 N22_—«4N23 
~~ ee (N) 


{M] 


MiyNu + MyNo1 MyNi2 + MiNx2 MiyNi3 + M12N23 
= | MyNiy, + Ma2No, Mai Ni2 + M22N2 MaNi3 + MxNo3 |. (1.10) 
M3)Nu + M32No,  M31Ni2 + M32N22  M3,Ni3 + M32N23 
SS ee 
[P] 


The multiplication in Equation 1.10 can be written in the abbreviated matrix notation 
as 


[M][N] = [P]. (1.11) 
We can also use subscript/summation notation to write the same product as 
MijN jx = Pigs (1.12) 


with the implied sum over the j index keeping track of the summation. Notice j 
is in the second position of the M,; term and the first position of the Nj term, so 
the summation is over the columns of [M] and the rows of [N}], just as it was in 
Equation 1.10. Equation 1.12 is an expression for the ik” element of the matrix [P]. 

Matrix array notation is convenient for doing numerical calculations, especially 
when using a computer. When deriving the relationships between the various quan- 
tities in physics, however, matrix notation is often inadequate because it lacks a 
mechanism for keeping track of the geometry of the coordinate system. For example, 
in a particular coordinate system, the vector V might be written as 


V = 18 + 3 + 263. (1.13) 


When performing calculations, it is sometimes convenient to use a matrix represen- 
tation of this vector by writing: 


V— [VJ = {,3]. (1.14) 


The problem with this notation is that there is no convenient way to incorporate the 
basis vectors into the matrix. This is why we are careful to use an arrow (—) in 
Equation 1.14 instead of an equal sign (=). In this text, an equal sign between two 
quantities means that they are perfectly equivalent in every way. One quantity may 
be substituted for the other in any expression. For instance, Equation 1.13 implies 
that the quantity 1é, + 3@ + 2@; can replace V in any mathematical expression, and 
vice-versa. In contrast, the arrow in Equation 1.14 implies that [V] can represent V, 
and that calculations can be performed using it, but we must be careful not to directly 
substitute one for the other without specifying the basis vectors associated with the 
components of [V]. 
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1.2. VECTOR OPERATIONS 


In this section, we investigate several vector operations. We will use all the different 
forms of notation discussed in the previous section in order to illustrate their dif- 
ferences. Initially, we will concentrate on matrix and matrix array notation. As we 
progress, the subscript/summation notation will be used more frequently. 

As we discussed earlier, a three-dimensional vector V can be represented using a 
matrix. There are actually two ways to write this matrix. It can be either a (3 X 1) 
column matrix or a (1 X 3) row matrix, whose elements are the components of the 
vector in some Cartesian basis: 


V[V] = | V2 o §=V>[VJi=[V, Vo Ve]. (1.15) 


The standard notation [V]' has been used to indicate the transpose of [V], indicating 
an interchange of rows and columns. Remember the vector V can have an infinite 
number of different matrix array representations, each written with respect to a 
different coordinate basis. 


1.2.1 Vector Rotation 


Consider the simple rotation of a vector in a Cartesian coordinate system. This 
example will be worked out, without any real loss of generality, in two dimensions. 

We start with the vector A, which is oriented at an angle @ to the 1-axis, as shown 
in Figure 1.2. This vector can be written in terms of its Cartesian components as 


A= A\& + Are, (1.16) 
where 
A; = Acos@ A2 = Asin @. (1.17) 


In these expressions A = {A| = ,/A? + A} is the magnitude of the vector A. The 
vector A is generated by rotating the vector A counterclockwise by an angle @. This 


Geometry for Vector Rotation 
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changes the orientation of the vector, but not its magnitude. Therefore, we can write 


A’ = Acos(@ + )é; + Asin(@ + d)é. (1.18) 
wenn oom — ee” 
A! A 


1 2 


The components Aj and Aj can be rewritten using the trigonometric identities for the 
cosine and sine of summed angles as 


be = = : ; 
A; = Acos(@ + @) = Acos @cos @ — Asin Osin } 


al Az (1.19) 
—_ i = i + i - . 
Aj = Asin(6 + $) one Asin @cos 
1 A2 


If we represent A and A’ with column matrices, 
er te Ul 
A= A= | A’ [A= Be (1.20) 
Equations 1.19 can be put into matrix array form as 
Aj| _ |cos@ —sing| | A; 
| = free cos@ | | Az] ° 2) 


In the abbreviated matrix notation, we can write this as 


[A'] = [R(#)]IA].- (1.22) 

In this last expression, [R(@)] is called the rotation matrix, and is clearly defined as 
_ |cos@ —singd 

cron) = [ee ane]. (1.23) 


Notice that for Equation 1.22 to be the same as Equation 1.19, and for the matrix 
multiplication to make sense, the matrices [A] and [A’] must be column matrix arrays 
and [R(¢)] must premultiply [A]. The result of Equation 1.19 can also be written 
using the row representations for A and A’. In this case, the transposes of [R], [A] 
and [A’] must be used, and [R]' must postmultiply [A]*: 


[A’]* = [A] (Ry. (1.24) 


Written out using matrix arrays, this expression becomes 


F - cos@ sind 
[A; AS] =[Ai sat SE ok (1.25) 


It is easy to see Equation 1.25 is entirely equivalent to Equation 1.21. 
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These same manipulations can be accomplished using subscript/summation nota- 
tion. For example, Equation 1.19 can be expressed as 


The matrix multiplication in Equation 1.22 sums over the columns of the elements 
of [R]. This is accomplished in Equation 1.26 by the implied sum over j. Unlike 
matrix notation, in subscript/summation notation the order of A, and Rj; is no longer 
important, because 
R;;Aj; = AjRij- (1.27) 
The vector A’ can be written using the subscript notation by combining Equa- 
tion 1.26 with the basis vectors 


A , Rij Aj&.- (1.28) 


This expression demonstrates a “notational bookkeeping” property of the subscript 
notation. Summing over a subscript removes its dependence from an expression, much 
like integrating over a variable. For this reason, the process of subscript summation 
is often called contracting over an index. There are two sums on the right-hand side 
(RHS) of Equation 1.28, one over the i and another over j. After contraction over 
both subscripts, the are no subscripts remaining on the RHS. This is consistent with 
the fact that there are no subscripts on the left-hand side (LHS). The only notation 
on the LHS is the “overbar” on A’, indicating a vector, which also exists on the RHS 
in the form of a “hat” on the basis vector 6;. This sort of notational analysis can be 
applied to all equations. The notation on the LHS of an equals sign must always agree 
with the notation on the RHS. This fact can be used to check equations for accuracy. 
For example, 


A # R,jAj, (1.29) 


because a subscript i remains on the RHS after contracting over j, while there are no 
subscripts at all on the LHS. In addition, the notation indicates the LHS is a vector 
quantity, while the RHS is not. 


1.2.2 Vector Products 


We now consider the dot and cross products of two vectors using subscript/summation 
notation. These products occur frequently in physics calculations, at every level. The 
dot product is usually first encountered when calculating the work W done by a force 
F in the line integral 


WH ‘park. (1.30) 
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In this equation, dT is a differential displacement vector. The cross product can be 
used to find the force on a particle of charge gq moving with velocity Vin an externally 
applied magnetic field B: 


F = q(v X B). (1.31) 


The Dot Product The dot or inner product of two vectors, A and B, is a scalar 
defined by 


A - B = |A\[B\ cos @, (1.32) 


where @ is the angle between the two vectors, as shown in Figure 1.3. If we take the 
dot product of a vector with itself, we get the magnitude squared of that vector: 


A-A= |A|’. (1.33) 
In subscript/summation notation, Equation 1.32 is written as 
A: B= Ajé; - B,é;. (1.34) 


Notice we are using two different subscripts to form A and B. This is necessary to 
keep the summations independent in the manipulations that follow. The notational 
bookkeeping is working here, because there are no subscripts on the LHS, and none 
left on the RHS after contraction over both i and j. Only the basis vectors are involved 
in the dot product, so Equation 1.34 can be rewritten as 


A- B= A;B;(@; -é)). (1.35) 


Because we are restricting our attention to Cartesian systems where the basis vectors 
are orthonormal, we have 


7 1 eo 3 
66-15 wo: (1.36) 
The Kronecker delta, 
_fil i=j 
6; = a pepe (1.37) 
2 


Figure 1.3 The Dot Product 
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facilitates calculations that involve dot products. Using it, we can write é; - é; = 6,;, 
and Equation 1.35 becomes 


Equation 1.38 can be expanded by explicitly doing the independent sums over both i 
and j 


A M B = A,B 63, + A, B26)2 + A, B3643 + A2B, 81 a (1.39) 


Since the Kronecker delta is zero unless its subscripts are equal, Equation 1.39 reduces 
to only three terms, 


A-B = A,B, + ApBy + A3B3 = A;B;. (1.40) 


As one becomes more familiar with the subscript/summation notation and the 
Kronecker delta, these last steps here are done automatically with the RHS of the 
brain. Anytime a Kronecker delta exists in a term, with one of its subscripts repeated 
somewhere else in the same term, the Kronecker delta can be removed, and every 
instance of the repeated subscript changed to the other subscript of the Kronecker 
symbol. For example, 


Aj6;; oe Aj. (1.41) 


In Equation 1.38 the Kronecker delta can be grouped with the B; factor, and contracted 
over j to give 


A;(Bj8;;) = A;B;. (1.42) 


Just as well, we could group it with the A; factor, and sum over i to give an equivalent 
result: 


B,(A;6;;) = B;Aj. (1.43) 
This is true for more complicated expressions as well. For example, 
Mi j(Agdix) = MijAi 
or (1.44) 
BiT (Om 5 jm) = BiT jx€;. 


This flexibility is one of the things that makes calculations performed with sub- 
script/summation notation easier than working with matrix notation. 

We should point out that the Kronecker delta can also be viewed as a matrix or 
matrix array. In three dimensions, this representation becomes 


10 0 
8; -T1=]0 1 Of. (1.45) 
001 


This matrix can be used to write Equation 1.38 in matrix notation. Notice the con- 
traction over the index i sums over the rows of the [1] matrix, while the contraction 
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over j sums over the columns. Thus, Equation 1.38 in matrix notation is 


1 0 0] fB 
A-B-— [A]"[1][B] =[A; A. A3]|0 1 O| |B 
00 1] 1B; 

= [A]'[BI. (1.46) 


The Cross Product The cross or vector product between two vectors A and B forms 
a third vector C, which is written 


C=AXB. (1.47) 
The magnitude of the vector C is 
(C| = |[A[Bj sin @, (1.48) 


where 6 is the angle between the two vectors, as shown in Figure 1.4. The direction 
of C depends on the “handedness” of the coordinate system. By convention, the 
three-dimensional coordinate systems in physics are usually “right-handed.” Extend 
the fingers of your right hand straight, aligned along the basis vector é,. Now, curl 
your fingers toward the basis vector 6). If your thumb now points along é3, the 
coordinate system is right-handed. When the coordinate system is arranged this way, 
the direction of the cross product follows a similar rule. To determine the direction of 
C in Equation 1.47, point your fingers along A, and curl them to point along B. Your 
thumb is now pointing in the direction of C. This definition is usually called the right- 
hand rule. Notice, the direction of C is always perpendicular to the plane formed 
by A and B. If, for some reason, we are using a left-handed coordinate system, 
the definition for the cross product changes, and we must instead use a left-hand 
rule. Because the definition of a cross product changes slightly when we move to 


Figure 1.4 The Cross Product 
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systems of different handedness, the cross product is not exactly a vector, but rather a 
pseudovector. We will discuss this distinction in more detail at the end of Chapter 4. 
For now, we will limit our discussion to right-handed coordinate systems, and treat 
the cross product as an ordinary vector. 

Another way to express the cross product is by using an unconventional determi- 
nant of a matrix, some of whose elements are basis vectors: 


ois Fan & & & 
AXB=]| A; Ard A3 ; (1.49) 
B, Bo Bs 


det 
Expanding the determinant of Equation 1.49 gives 
A X B = (AB; — A3B2)é; 
+ (A3B, — A1B3)€2 (1.50) 
+ (A; Bz — A2B,)s. 
This last expression can also be written using subscript/summation notation, with 
the introduction of the Levi-Civita symbol & jx: 
A X B= A;Bjeéje, (1.51) 
where €;;, is defined as 
+) for (i, j,k) = even permutations of (1, 2, 3) 


Geo 4 for (i, j,k) = odd permutations of (1,2,3) . (1.52) 
0 if two or more of the subscripts are equal 


An odd permutation of (1,2, 3) is any rearrangement of the three numbers that can be 
accomplished with an odd number of pair interchanges. Thus, the odd permutations 
of (1,2,3) are (2, 1,3), (1,3,2), and (3,2, 1). Similarly, the even permutations of 
(1, 2, 3) are (1, 2, 3), (2, 3, 1), and (3, 1, 2). Because the subscripts 7, 7, and k can each 
independently take the values (1, 2,3), one way to visualize the Levi-Civita symbol 
is as the 3 X 3 X 3 array shown in Figure 1.5. 
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The cross product, written using subscript/summation notation in Equation 1.51, 
and the dot product, written in the form of Equation 1.38 are very useful for manual 
calculations, as you will see in the following examples. 


1.2.3. Calculations Using Subscript/Summation Notation 


We now give two examples to demonstrate the use of subscript/summation notation. 
The first example shows that a vector’s magnitude is unaffected by rotations. The 
primary function of this example is to show how a derivation performed entirely 
with matrix notation can also be done using subscript notation. The second derives a 
common vector identity. This example shows how the subscript notation is a powerful 
tool for deriving complicated vector relationships. 


Example 1.1 Refer back to the rotation picture of Figure 1.2, and consider the prod- 

ucts A-A and A’ - A’, first using matrix notation and then using subscript/summation 

notation. Because A’ is generated by a simple rotation of A we know these two dot 

products, which represent the magnitude squared of the vectors, should be equal. 
Using matrices: 


A-A-— [A]'{A] (1.53) 
A’- A’ [A'[A’). (1.54) 

But [A’] and [A’]' can be expressed in terms of [A] and [A]* as 
[A'] = [RPA] LAT" = TAT ER@)I, (1.55) 


where R(@) is the rotation matrix defined in Equation 1.23. If these two equations 
are substituted into Equation 1.54, we have 


A’ - A’ = [A]*[R(P)I' LRP) ILA). (1.56) 


The product between the two rotation matrices can be performed, 


Ran tRcoy = | SP olla eels iP (1.57) 


—sing cosd sind cosd 
and Equation 1.56 becomes 
A - A’ [A]'[1}[A] = [A}‘[4] > A- A, (1.58) 
Our final conclusion is that 
A’-A'=A-A. (1.59) 


To arrive at this result using matrices, care had to be taken to be sure that the matrix 
operations were in the proper order and that the row, column, and transpose matrices 
were all used correctly. 


VECTOR OPERATIONS 13 


Now let’s repeat this derivation using the subscript/summation notation. Equa- 
tion 1.40 allows us to write 


A-A= AjAj (1.60) 


A’-A’ = AlAl, (1.61) 


Notice how we have been careful to use different subscripts for the two sums in 
Equations 1.60 and 1.61. This ensures the sums wil] remain independent as they 
are manipulated in the following steps. The primed components can be expressed in 
terms of the unprimed components as 


Aj =, RijAj, (1.62) 


where R;; is the ij" component of the rotation matrix R[]. Inserting this expression 
into Equation 1.61 gives 


A’-A’ = R,A,RryAy, (1.63) 


where again, we have been careful to use the two different subscripts u and v. This 
equation has three implicit sums, over the subscripts r, u, and v. 

In subscript notation, unlike matrix notation, the ordering of the terms is not 
important, so we rearrange Equation 1.63 to read 


A’ - A’ = A,AyRnRrv- (1.64) 


Next concentrate on the sum over r, which only involves the [RJ] matrix elements, 
in the product R,,R,,. What exactly does this product mean? Let’s compare it to an 
operation we discussed earlier. In Equation 1.12, we pointed out the subscripted ex- 
pression M; ;N , represented the regular matrix product [M][{N], because the summed 
subscript j is in the second position of the {M] matrix and the first position of the 
{N] matrix. The expression R,,R,,, however, has a contraction over the first index of 
both matrices. In order to make sense of this product, we write the first instance of 
[R] using the transpose: 


RrRry > [R)'[RI- (1.65) 
Consequently, from Equation 1.57, 
RuRry = Suv- (1.66) 
Substituting this result into Equation 1.64 gives 
A’- A’ = A,A,8y = AvAy = A- A, (1.67) 


Admittedly, this example is too easy. It does not demonstrate any significant 
advantage of using the subscript/summation notation over matrices. It does, how- 
ever, highlight the equivalence of the two approaches. In our next example, the 
subscript/summation notation will prove to be almost indispensable. 
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Example 1.2. The subscript/summation notation allows the derivation of vector 
identities that seem almost impossible using any other approach. The example worked 
out here is the derivation of an identity for the double cross product between three 
vectors, A X (B X C). This one example essentially demonstrates all the common 
operations that occur in these types of manipulations. Other examples are suggested 
in the problems listed at the end of this chapter. 

The expression A X (B X ©) is written in vector notation and is valid in any 
coordinate system. To derive our identity, we will convert this expression into sub- 
script/summation notation in a Cartesian coordinate system. In the end, however, we 
will return our answer to vector notation to obtain a result that does not depend upon 
any coordinate system. In this example, we will need to use the subscripted form for 
a vector 


V = Vid, (1.68) 


for a dot product between two vectors 


A: B= AjB;, (1.69) 
and for a cross product 
A X B= A;Bje€;jx. (1.70) 
To begin, let 
D=BxC, (1.71) 


which, written using the Levi-Civita symbol, is 
D = B,C je €jx- (1.72) 


Substituting Equation 1.71 into the expression A X (B X ©), and using the Levi-Civita 
expression again, gives 


AX (BX © =AXD = A,D,ee 51. (1.73) 


The s* component of D is obtained by dot multiplying both sides of Equation 1.72 
by 6, as follows: 
D, = 6D = 6» BC jee 
= BiCjeije(Es > &) 
1.74 
= BCE: 5x a) 
= B;C jGijs- 


Substituting the result of Equation 1.74 into Equation 1.73 gives 


A x (B x C) = A, BjC j€js€:€rst, (1.75) 
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which we rearrange slightly to read 
A X (BX C) = A,B,C j€€51€;js- (1.76) 
To proceed, some properties of the Levi-Civita symbol need to be developed. First, 
because of the definition of the Levi-Civita symbol given in Equations 1.52, it is clear 
that reversing any two of its subscripts just changes its sign, i.e., 
Eijk = — €ikj = Ejki- (1.77) 
The second property involves the product of two Levi-Civita symbols that have a 
common last subscript: 
€ijkEmnk = Sim9 jn — SinDjm- (1.78) 


With a considerable amount of effort, it can be shown that the RHS of Equation 1.78 
has all the properties described by the product of the two Levi-Civita symbols on 
the LHS, each governed by Equations 1.52. A proof of this identity is given in 
Appendix A. 

With Equations 1.77 and 1.78 we can return to Equation 1.76, which can now be 
rewritten 


A x (B x ©) = A,B,C ;€,(8, 54 — 67:5,;)- (1.79) 
After removing the Kronecker deltas, we obtain 
A x (B x ©) = A Bi C@; 4 AjB;C é;. (1.80) 


At this point, you can really see the usefulness of the subscript/summation notation. 

The factors in the two terms on the RHS of Equation 1.80 can now be rearranged, 

grouped according to the summations, and returned to vector notation in just two 
lines! The procedure is: 

A X (Bx © = (A;C;)(B&) — (A:Bi(C;é;) (1.81) 

= (A: ©)B-— (A- BC. (1.82) 


Equation 1.81 is valid only in Cartesian systems. But because Equation 1.82 is in 
vector notation, it is valid in any coordinate system. 


In the exercises that follow, you will derive several other vector identities. These 
will illustrate the power of the subscript/summation notation and help you become 
more familiar with its use. 


EXERCISES FOR CHAPTER 1 


1. Consider the two matrices: 


m= |; ‘| m= |i ale 
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With matrix notation, a product between these two matrices can be expressed as 
[M][N]. Using subscript/summation notation, this same product is expressed as 
M, i jN jk 
(a) With the elements of the [M] and [N] matrices given above, evaluate the 
matrix products [M][N], [N][M] and [M][M], leaving the results in matrix 
array notation. 
(b) Express the matrix products of part (a) using the subscript/summation nota- 
tion. 
(c) Convert the following expressions, which are written in subscript/summation 
notation, to matrix notation: 
i. M ikNi je 
ii. M, i NE je 
iti. M jiN jx. 
iv. MijNix + Ti. 
Vv. M jiNxj + Tx. 


. Consider the square 3 < 3 matrix [M] whose elements M;; are generated by the 


expression 
Mi; = ij? force, j = 1,2,3, 
and a vector V whose components in some basis are given by 
Vi =k fork = 1,2,3. 


(a) Using a matrix representation for V — [V], determine the components of 
the vector that result from a premultiplication of [V] by [M]. 
(b) Determine the components of the vector that result from a premultiplication 


of [M] by [VJ]. 


» The matrix 


[R] = cos 6 | 


—sin@ cos@ 


represents a rotation. Show that the matrices [RE = [R][Rl and [RP = [RI[RI[RI 
are matrices that correspond to rotations of 26 and 36 respectively. 


. Let [D] be a 2 X 2 square matrix and [V] a 2 < 1 row matrix. Determine the 


conditions imposed on [D] by the requirement that 
[DILV] = (V]"(D] 
for any [V]. 


. The trace of a matrix is the sum of all its diagonal elements. Using the sub- 


script/summation notation to represent the elements of the matrices [J] and 
[M4], 
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(a) Find an expression for the trace of [T]. 


(b) Show that the trace of the matrix formed by the product [T][M] is equal to 
the trace of the matrix formed by [M][T]. 


. Let [M] be a square matrix. Express the elements of the following matrix products 


using subscript/summation notation: 
(a) [MI[M}". 
(b) [M]'[M]. 
(c) [M]*[M]'. 


. Convert the following Cartesian, subscript/summation expressions into vector 


notation: 

(a) V;A;Bjé;. 

(b) cA;Bj8;;. 

(c) A,B je, € jx 8). 

(d) A;BjCiDin €; jx €imk- 


. Expand each of the following expressions by explicitly writing out all the terms 


10. 


in the implied summations. Assume (i, j, k) can each take on the values (1, 2, 3). 
(a) 6; jCijk: 

(b) T;;A;. 

(c) 8,;TjjAi. 

Express the value of the determinant of the matrix 


ay; 412 a3 
42; 422 423 
43; 432 433 


using subscript/summation notation and the Levi-Civita symbol. 


Prove the following vector identities using subscript/summation notation: 
(a) A: (BX C) = (AX B)-C. 

(b) AX B= —BXA. 

(c) (A X B)- (CX D) = (A- CB: D) — (A DB: ©). 

(d) (A X B) X (C X D) = [(A X B) - DIC — [(A x B) - CUD. 


- 


DIFFERENTIAL AND INTEGRAL 
OPERATIONS ON VECTOR AND 
SCALAR FIELDS 


A field quantity is a function that depends upon space and possibly time. Electric 
potential, charge density, temperature, and pressure each have only a magnitude, and 
are described by scalar fields. In contrast, electric and magnetic fields, gravity, current 
density, and fluid velocity have both magnitude and direction, and are described by 
vector fields. 

Integral and differential operations on vector and scalar field quantities can be 
expressed concisely using subscript notation and the operator formalism which is 
introduced in this chapter. 


2.1 PLOTTING SCALAR AND VECTOR FIELDS 


Because it is frequently useful to picture vector and scalar fields, we begin with a 
discussion of plotting techniques. 


2.1.1 Plotting Scalar Fields 


Plots of scalar fields are much easier to construct than plots of vector fields, because 
scalar fields are characterized by only a single value at each location in time and space. 
Consider the specific example of the electric potential ® produced by two parallel, 
uniform line charges of +A, coulombs per meter that are located at (x = +1, y = 0). 
In meter-kilogram-second (MKS) units, the electric potential produced by this charge 
distribution is 


2 2 
ro | \e3 ay | (2.1) 


4me, |(x—-1P? + y? 
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Eguipotentials ____~~ Electric field lines 


Figure 2.1 Equipotentials and Electric Field Lines of Two Parallel Line Charges 


Usually we want to construct surfaces of constant ®, which in this case are cylinders 
nested around the line charges. Because there is symmetry in the z direction, these 
surfaces can be plotted in two dimensions as the dashed, circular contours shown in 
Figure 2.1. The centers of these circles are located along the x-axis from 1 < x << 
for positive values of ®, and from —o < x < —1 for negative values of . ® = 0 
lies on the y-axis, which you can think of as a circle of infinite radius with its center 
at infinity. If the contours have evenly spaced constant values of ®, the regions of 
highest line density show where the function is most rapidly varying with position. 


2.1.2 Plotting Vector Fields 


Because vectors have both magnitude and direction, plots for their fields are usually 
much more complicated than for scalar fields. For example, the Cartesian components 
of the electric field of the preceding example can be calculated to be 


_ a® _ No x? rt y? —]1 

E, Ox ‘TE, lz — 1)? + y?][( + 1)? + a (2.2) 
=~ a® oo ro 2xy 

° dy TE, Fe — 1)? + y2I[@ + 1)2 + al ' (2.3) 


A vector field is typically drawn by constructing lines which are tangent to the 
field vector at every point in space. By convention, the density of these field lines 
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indicates the magnitude of the field, while arrows show its direction. If we suppose 
an electric field line for Equations 2.2 and 2.3 is given by the equation y = y(x), then 


== = =. 2.4 
dx E, x%-y?-1 ay 

With some work, Equation 2.4 can be integrated to give 
P+(y-ch =1+4+e’, (2.5) 


where c is the constant of integration. This constant can be varied between —© and 
© to generate the entire family of field lines. For this case, these lines are circles 
centered on the y-axis at y = c with radii given by \/1 + c?. They are shown as the 
solid lines in Figure 2.1. The arrows indicate how the field points from the positive 
to the negative charge. Notice the lines are most densely packed directly between the 
charges where the electric field is strongest. 


2.2. INTEGRAL OPERATORS 


2.2.1 Integral Operator Notation 


The gradient, divergence, and curl operations, which we will review later in this 
chapter, are naturally in operator form. That is, they can be represented by a symbol 
that operates on another quantity. For example, the gradient of ® is written as V®. 
Here the operator is V, which acts on the operand © to give us the gradient. 

In contrast, integral operations are commonly not written in operator form. The 
integral of f(x) over x is often expressed as 


/ F(x) dx, (2.6) 


which is not in operator form because the integral and the operand f(x) are inter- 
mingled. We can, however, put Equation 2.6 in operator form by reorganizing the 
terms in this equation: 


/ dx f(x). (2.7) 


Now the operator f dx acts on f(x) to form the integral, just as the Y¥ operator acts 
on ® to form the gradient. In practice, the integral operator is moved to the right, 
passing through all the terms of the integrand that do not depend on the integration 
variable. For example, 


i x(x + y)y? = 2 pu (x + y). (2.8) 
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2.2.2 Line Integrals 


The process of taking an integral along a path is called line integration and is a 
common operation in all branches of physics. For example, the work that a force F 
performs as it moves along path C is 


W= [ark (2.9) 
Cc 


Here the line integral operator i df acts on the force F. The differential displacement 


c 
vector d¥ is tangent to each point along C, as shown in Figure 2.2. If C closes on 
itself, we write the operator with a circle on the integral sign: 


$ ar. (2.10) 
c 


Because Equation 2.9 is written in vector notation, it is valid in any coordinate 
system. In a Cartesian system, df = dx;é; and Equation 2.9 becomes 


w= | ar-F- [ant (2.11) 
Cc Cc 


Notice that the quantity produced by this integration is a scalar, and the subscript i is 
summed implicitly. 

There are other less common line integral operations. For example, the operator 
in 


[are = a | ax, ® (2.12) 
c Cc 
acts on the scalar ® to produce a vector. Another example, 
fi dr X V = &; | ax; Vi Eijk, (2.13) 
c c 


generates a vector using the cross product. Note that all these subscripted expressions 
are written in a Cartesian system where the basis vectors are orthonormal and position 
independent. 


Figure 2.2 The Line Integral 
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2.2.3 Surface Integrals 
Surface integrals involve the integral operator, 


[. (2.14) 
Ss 


where d@ is a vector that represents a differential area. This vector has a magnitude 
equal to a differential area of S, and a direction perpendicular to the surface. If we 
write the differential area as do and the unit normal fi, the differential area vector 
becomes d@ = fi do. Because a surface has two sides, there is ambiguity in how 
to define fi. For a simple, closed surface such as the one in Figure 2.3(a), we define 
fi to always point in the “outward” direction. If the surface is not closed, i.e., does 
not enclose a volume, the direction of fi is typically determined by the closed path 
C that defines the boundaries of the surface, and the right-hand rule, as shown in 
Figure 2.3(b). 

Frequently, the surface integral operator acts on a vector field quantity by means 
of the dot product 


i, do-V. (2.15) 
Ss 
In Cartesian coordinates, 

dé = da;é;, (2.16) 


where do; is positive or negative depending on the sign of fi - é;, as discussed in the 
previous paragraph. This surface integral becomes 


[a ‘V= fe V;. (2.17) 
AY AY 


There are less common surface integrals of the form 


s s 


Figure 2.3 Surface Integrals 
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which is an operation on a scalar which produces a vector quantity, and 
[ee x Vv _ a | ao, V Eijk, (2.19) 
Ss Ss 


which also generates a vector. 


2.2.4 Volume Integrals 


The volume integral is the simplest integral operator because the variables of inte- 
gration are scalars. It is written 


i dt, (2.20) 
Vv 


where d7 is a differential volume, and V represents the total volume of integration. 
The most common volume integral acts on a scalar field quantity and, as a result, 
produces a scalar 


' dz ®, (2.21) 
v 
In Cartesian coordinates, this is written 
1 dx, dx, dxz ®. (2.22) 
Vv 
Volume integrals of vector quantities are also possible: 


i dtV= | dx, dx, dx, V. (2.23) 
V Vv 


2.3 DIFFERENTIAL OPERATIONS 


By their definition, field quantities are functions of position. Analogous to how 
the change in a function of a single variable is described by its derivative, the 
position dependence of a scalar field can be described by its gradient, and the position 
dependence of a vector field by its curl and divergence. The Del operator V is used 
to describe all three of these fundamental operations. 
The operator V is written in coordinate-independent vector notation. It can be 
expressed in subscript/summation notation in a Cartesian coordinate system as 
= .29 
Vv =¢,—. (2.24) 
OX; 
Keep in mind, this expression is only valid in Cartesian systems. It will need to be 
modified when we discuss non-Cartesian coordinate systems in Chapter 3. 
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When operating on a scalar field, the V operator produces a vector quantity called 
the gradient: 


ID(x}, x2, %3) 


ax, (2.25) 


V@(x1, 22,43) = & 
For example, in electromagnetism the electric field is equal to the negative gradient 
of the electric potential: 


E = -Vo = —¢, 22@0 22%) 
OX; 


(2.26) 

The Del operator also acts upon vector fields via either the dot product or the cross 
product. The divergence of a vector field is a scalar quantity created using the dot 
product: 


=< 0 OAj 
-A=6—-A sé; = —. 27 
A ba AR = 3 (2.27) 


The charge density p, in a region of space can be calculated using the divergence 
from the relation 


Pe = €(V - E). (2.28) 
If instead we used the cross product, we produce a vector quantity called the curl, 
= wTxz_f.? . {9a r 
VxXA= meee Xx Aje; = Bae Cx Eijk (2.29) 


where we have used the Levi-Civita symbol to express the cross product in sub- 
script/summation notation. One of Maxwell’s equations relates the electric field to 
the rate of change of the magnetic field using the curl operator: 


VxE--%. (2.30) 
at 


2.3.1 Physical Picture of the Gradient 
The gradient of a scalar field is a vector that describes, at each point, how the field 
changes with position. Dot multiplying both sides of Equation 2.25 by dr = dx,é; 
gives 
= a® 
dr-V® = dx;e; -éj;—. (2.31) 
Ox j 
Manipulating the RHS of this expression gives 


d¥-V® = 57 ax. (2.32) 
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The RHS of this expression can be recognized as the total differential change of ® 
due to a differential change of position d¥. The result can be written entirely in vector 
notation as 


d® = VO -dr. (2.33) 


From Equation 2.33, it is clear that the largest value of d® occurs when dF points in 
the same direction as V®. On the other hand, a displacement perpendicular to V® 
produces no change in ® because d® = 0. This means that the gradient will always 
point perpendicular to the surfaces of constant ®. 

At the beginning of this chapter, we discussed the electric potential function 
generated by two line charges. The electric field was generated by taking the gradient 
of this scalar potential, and was plotted in Figure 2.1. While we could use this example 
as a model for developing a physical picture for the gradient operation, it is relatively 
complicated. Instead, we will look at the simpler two-dimensional function 


® = —xy. (2.34) 


A plot of the equipotential lines of ® in the xy-plane is shown in Figure 2.4. The 
gradient operating on this function generates the vector field 


V® = —yé, — xéy. (2.35) 


Now imagine we are at the point (1,2) and move to the right, by an infinitesi- 
mal amount dr along the positive x-axis. The corresponding change in ® can be 
determined by calculating: 


d® =V®-d¥ 
= (—2@, — 1@,) - (dr 6,) (2.36) 


Figure 2.4 Surfaces of Constant ® = —xy 
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This says ® decreases by 2 units for every unit moved in that direction. If instead, 
we were sitting at the point (3, 4) and move an infinitesimal amount dr, at a 45-degree 
angle with respect to the x-axis, ® changes by an amount 


d® =V® -d¥ 
= (-4é sae gute + @) (2.37) 
x 'y 2 x y : 
Pee 


Notice, these changes are for infinitesimal quantities. To calculate the change of ® 
over a finite path, where the gradient changes as we move from point to point, we 
need to use a line integral 


A® = [ dt-V®. (2.38) 
Cc 


When using the gradient to generate a vector field, we usually add a negative sign 
to the definition. For example, the electric field is generated from the electrostatic 
potential by 


E = -V®9. (2.39) 


Using this convention, moving against the field lines increases ®. For the potential 
of Equation 2.34, the negative gradient is 


—V® = yé, + x8. (2.40) 


The field lines for this function can be determined as follows: 


ae 
dx y 
dyy=dxx 
y=x*+e 
Ne yr Se (2.41) 


These lines are perpendicular to the lines of constant ®, as shown in Figure 2.5. 
Notice how the density of the vector field lines shows that the magnitude of the field 
increases as you move away from the origin. 

In summary, the gradient of a scalar function ® generates a vector field which, at 
each point, indicates the direction of the greatest increase of P and lies perpendicular 
to the lines or surfaces of constant ®. The example discussed above was worked 
out in two dimensions, but the process can also be visualized in three dimensions, 
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TanmN 


Figure 2.5 Field Lines for ® = —xy 


where ® = constant generates surfaces, and the gradient is always normal to these 
surfaces. Except for visualization, there are no limits to the number of dimensions 
for the gradient operation. 


2.3.2 Physical Picture of the Divergence 


The divergence operation will be described physically by developing the continuity 
equation, which describes the change in the local particle density of a fluid as a func- 
tion of time. Let p(x, y, z, f) be the number of particles per unit volume and V(x, y, z, t) 
be the velocity of these particles, at the point (x, y, z) and the time t. Consider a dif- 
ferential volume dt = dx dy dz located at (%, yo, Zo), as shown in Figure 2.6. The 
continuity equation is obtained by making the assumption that particles can be neither 
created nor destroyed in this volume, and then equating any net flow of particles into 
or out of the volume with a corresponding change in p. If we call the total number of 
particles in the infinitesimal volume N = pdr, then 


ON _ 90(Xo, Vos Zost) 
ot ot 


dx dy dz. (2.42) 


The rate of change of N in Equation 2.42 needs to be accounted for by the flow 
of particles across the six surfaces of dr. First, consider the bottom cross-hatched 
surface of Figure 2.6. The flow across this surface can be determined with the aid 
of Figure 2.7(a). The total number of particles that enter the volume in a time dt 
across this surface is equal to the number of particles in the shaded region dx dy v,dt. 
Notice, a positive v, adds particles to the volume, while a negative v, removes them. 
Thus, the contribution of the bottom surface to dN/dt is 


ON, bottom 


as P(%o2 Vos Zos F)Uz( Xo, Vos Zo, t) Ax dy. (2.43) 
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/« 
ix 


Figure 2.6 Differential Volume 


Notice, both p and v, are evaluated at (%0, yo, Zo) in this expression. By defining the 
vector J = pv, which is called the current density, Equation 2.43 can be written more 
concisely as 


ON, bottom 
ot 


This same type of calculation can be made for the top cross-hatched surface 
shown in Figure 2.6. Figure 2.7(b) shows that now a positive vu, carries the number 
of particles in the shaded region out of the volume. This side contributes 


ON top 
ot 


to the total dN/dt. Notice, in this case, we evaluate J, at the point (%9, Yo, Zo + dz). 
Combining Equations 2.44 and 2.45 gives 


= J,(X9, Yor Zo, t) dx dy. (2.44) 


= —J(%o, Yo: Zo + dz, t) dx dy (2.45) 


ON, bottom oe ON, top 


a rr [Je(%os You2Z0.t) ~ Je(Xos Yor Zo + dz,t)| dxdy. (2.46) 


(a) (b) 
Figure 2.7 Flow Across the Bottom and Top Surfaces 
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This last expression can be written in terms of the derivative of J,, because in the 
differential limit we have 


ad 
IiXo, Yor%o + dz,t) = Ig(Xos Yoo %ost) + =* dz. (2.47) 
a (X0,Yo.20) 
Substitution of Equation 2.47 into Equation 2.46 gives 
ONbotom , ONto, 
ANeotiom 5 INiop _ _ Az dx dy dz. (2.48) 
ot dt oz (XoYo:2o) 


Working on the other four surfaces produces similar results. The total flow into 
the differential volume is therefore 


_ ody 
(%0,¥o Zo) dy 


_ ad, 
aH 


ON _ _ Oss 
Ox 


dx dy dz, (2.49) 


(Xo,Yo:Zo) (%0,Yor20) 


which can be recognized as —V - J times dr. Combining this result with Equation 
2.42 gives us the continuity equation: 


—=-V-j. (2.50) 


For a positive divergence of J, more particles are leaving than entering a region so 
dp/ dt is negative. 

This exercise provides us with a physical interpretation of the divergence. If the 
divergence of a vector field is positive in a region, this region is a source. Field 
lines are “born” in source regions. On the other hand, if the divergence in a region is 
negative, the region is a sink. Field lines “terminate” in sink regions. If the divergence 
of a vector field is zero in a region, then all field lines that enter it must also leave. 


2.3.3 Physical Picture of the Curl 


The curl of a vector field is a vector, which describes on a local scale the field’s 
circulation. From the word curl, itself, it seems reasonable to conclude that if a vector 
field has a nonzero curl the field lines should be “curved,” while fields with zero curl 
should be “straight.” This is a common misconception. It is possible for vector field 
lines to appear as shown in Figure 2.8(a), clearly describing a “curved” situation, 
and have no curl. Also, the fields lines shown in Figure 2.8(b), which are definitely 
“straight,” can have a nonzero curl. To resolve this confusion, we must look at the 
curl on a differential scale. 

Consider a vector field V that is a function of only x and y. The curl of this field 
points in the z-direction, and according to Equation 2.29 is given by 


VxV= (2 es oe, (2.51) 


“Ox. oy 


for a Cartesian coordinate system. 
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(a) (b) 
Figure 2.8 Circulating and Noncirculating Vector Fields 


Now consider the line integral of a vector field V around the closed path shown in 
Figure 2.9, 


¢ d¥-V. (2.52) 
C 


The starting point for the integration is the point (x, yo). In this derivation, we need to 
be a little more careful with the treatment of infinitesimal quantities than we were in 
the discussion of the divergence. For this reason, we begin by letting the dimensions 
of the loop be Ax and Ay, as shown in the figure. Later in the derivation, we will 
shrink these to infinitesimal quantities to obtain our final result. , 

The integral along C can be broken up into four parts. First consider the integration 
along C,, where y = y, and x varies from x, to x» + Ax: 


= Xo t Ax 
adr-V= | dx V,.. (2.53) 


CQ Xo 


Along this segment, we can expand V,(x, y,) with a Taylor series, keeping up to the 
linear term in x: 

ov, 
Ox 


(x ~ Xo). (2.54) 


(o,¥o) 


V(X, Yo) = Wx(%o. Yo) + 


Ay 


(Xo: Yo) 


Figure 2.9 Closed Path for Curl Integration 
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Keeping higher-order terms in this expansion will not change the results of this deriva- 
tion. Substituting Equation 2.54 into Equation 2.53 and performing the integration 
gives 


(Ax). (2.55) 


(%0,Yo) 


= 1 OV, 
d¥-V = V,(Xo, Yo) Ax + = — 
i: (on Yo)Ax 2 ax 


Next the integration along C3, the top section of the loop, is performed. Along this 
path, y is held fixed at y = y, + Ay, while x varies from x, + Ax to x,. Therefore, 


pave aeVis (2.56) 
C3 Xot Ax 


0 


Again, we expand V,(x, y, + Ay) to first order with a Taylor series: 


ov», 


Vy 


Ox 


Vi(X, Yo + Ay) = V,(Xo; Yo) + 


(x — X) + 


Ay. (2.57) 
(%0,Yo) dy 


(X0,Yo) 


Substituting Equation 2.57 into Equation 2.56 and performing the integration gives 


7 av. Vi 
‘) d¥-V = —V,(Xo, Yo) AX — sat (Axy - evs Ax Ay. (2.58) 
C3 ox (%0,Yo) (X0,Yo) 
Combining Equations 2.55 and 2.58 gives 
dr-V+ deve aoe Ax Ay. (2.59) 
Cc C3 oy (Xo sYo) 


After performing similar integrations along the C, and C, paths, we can combine all 
the results to obtain 


~ ov 

$ d¥-V~= (% 

Cc Ox 

The error of Equation 2.60 vanishes as the dimensions of the loop shrink to the 
infinitesimal, Ax — 0 and Ay — 0. Also, using Equation 2.51, the term in the 


brackets on the RHS of Equation 2.60 can be identified as the z component of vVxV. 
Therefore, we can write 


ov; 


oy 


) Ax Ay. (2.60) 
(%0,Yo) 


(Xoo) 


im, nee -V=@-(V xV) tim [ ae, (2.61) 
where C is the contour that encloses S and do, = dx dy is a differential area of that 
surface. 

What does all this tell us about the curl? The result in Equation 2.61 can be 
reorganized as: 


(2.62) 
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(a) (b) 
Figure 2.10 Fields with Zero (a) and Nonzero (b) Curl 


This says the z component of V X V at a point is the line integral of V on a loop 
around that point, divided by the area of that loop, in the limit as the loop becomes 
vanishingly small. Thus, the curl does not directly tell us anything about the circulation 
on a macroscopic scale. 

The situations in Figure 2.8 can now be understood. If the “curved” field shown in 
Figure 2.10(a) has a magnitude that drops off as 1/r, exactly enough to compensate 
for the increase in path length as r increases, then the integral around the closed 
differential path shown in the figure will be zero. Thus, the curl at that point is 
also zero. If the magnitude of the “straight” vector field in Figure 2.10(b) varies as 
indicated by the line density, the integral around the differential path shown cannot 
be zero and the field will have a nonzero curl. 

We derived Equation 2.61 in two dimensions and only picked out the z-component 
of the curl. The generalization of this result to three dimensions, and for any orienta- 
tion of the differential loop, is straightforward and is given by 

lim $ dr-V = (V Xx Vv) : lim, f az. (2.63) 
cC-0 Jc 5-0 Js 
In this equation, S is still the area enclosed by the path C. Note that the direction of 
d@ is determined by the direction of C and a right-hand convention. 


2.3.4 Differential Operator Identities 


Subscript/summation notation greatly facilitates the derivation of differential operator 
identities. The relations presented in this section are similar to the vector identities 
discussed in Chapter |, except now care must be taken to obey the rules of differential 
calculus. As with the vector identities, a Cartesian coordinate system is used for 
the derivations, but the final results are expressed in coordinate-independent vector 
notation. 


Example 2.1 Consider the operator expression, V - (V®). To translate this into 
subscript/summation notation, make the substitution 
— to] 
V =@¢—. (2.64) 
Ox; 
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The two V operators in the original expression must be written using independent 
subscripts: 


= <= , oO . oO 
V+ (Ve) = 85 (=). (2.65) 


Because the basis vectors of a Cartesian system are position independent, 0€ ;/dx; = 
0, and Equation 2.65 becomes 


¥-(¥o) = @- a5 (Le) 
8 (ag") 
ala”) 
Gt ae tam) 2.6 


In the last line, we have written out the implied summation explicitly to show you 
how it is working in this case. Equation 2.66 can be returned to vector notation by 
defining the Laplacian operator V? as 


— — g/d a a a 
V=V-V= ( = ( se + ®, 2.67 
Ox} Ox; ) O*x, 07 x2 07x34 ) ( ) 


so that 


V- (Vo) = Vo. (2.68) 


Example 2.2 Consider the expression V X V x V, which is the curl of the curl 
of V. This quantity arises when developing the electromagnetic wave equation from 
Maxwell’s equations. To write this in subscript notation in a Cartesian coordinate 
system, we need to use two Levi-Civita symbols: 


eee : 
VxVxV= fee sErsj Oxi (2.69) 


Equation 2.69 can be manipulated as follows: 


ee ey 27 ao 
VxVxKVe= Fo (oe teenie 


_@ OV; . 

Dx (S ) erent; 
_ 8 (NV; 
rales 


Mt 


i 


)& (88% ~ 343s) 


34 DIFFERENTIAL AND INTEGRAL OPERATIONS 


_ ¢@ (“ye _ to (Ste 
8x; \dx,) * ax; \ ax) * 


(ZS) 2 x (2), (2.70) 


> 


It 


Finally, the right-hand side of Equation 2.70 is converted back into vector notation 
to obtain the result 
VxVxV=V(V-V) —V’V. (2.71) 


Notice that the Laplacian operator can act on both vector and scalar fields. In Equation 
2.68, the Laplacian operates on a scalar field to product a scalar. In Equation 2.71, it 
operates on a vector field to produce a vector. 


2.4 INTEGRAL DEFINITIONS OF THE DIFFERENTIAL OPERATORS 


In Equations 2.25, 2.27, and 2.29, we provided expressions for calculating the gradi- 
ent, divergence, and curl. Each of these relations is valid only in a Cartesian coordinate 
system, and all are in terms of spatial derivatives of the field. Integral definitions of 
each of the differential operators also exist. We already derived one such definition 
for the curl in Equation 2.63. In this section, we present similar definitions for the 
gradient and divergence, as well as an alternate definition for the curl. Their deriva- 
tions, which are all similar to the derivation of Equation 2.63, are in most introductory 
calculus texts. We present only the results here. 
The gradient of a scalar field at a particular point can be generated from 


tall (2.72) 


where V is a volume that includes the point of interest, and S is the closed surface 
that surrounds V. Both S and V must be shrunk to infinitesimal size for this equation 
to hold. 

To get the divergence of a vector field at a point, we integrate the vector field over 
an infinitesimal surface S that encloses that point, and divide by the infinitesimal 
volume: 


Se da:A 
ViaSat (2.73) 
svso fydt 
We already derived the integral definition 
lim ¢ dr-V= (V XV) - lim | d@, (2.74) 


which generates the curl. This definition is a bit clumsy because it requires the 
calculation of three different integrals, each with a different orientation of S, to get 
all three components of the curl. The following integral definition does not have this 
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problem, but it uses an uncommon form of the surface integral: 


(2.75) 


2.5 THE THEOREMS 


The differential operators give us information about the variation of vector and 
scalar fields on an infinitesimal scale. To apply them on a macroscopic scale, we 
need to introduce four important theorems. Gauss’s Theorem, Green’s Theorem, 
Stokes’s Theorem, and Helmholtz’s Theorem can be derived directly from the integral 
definitions of the differential operators. We give special attention to the proof and 
discussion of Helmholtz’s Theorem because it is not covered adequately in many 
texts. 


2.5.1 Gauss’s Theorem 


Gauss’s Theorem is derived from Equation 2.73, written in a slightly different form: 
V -Adr = lim pao A. (2.76) 
50 Js 


In this equation, the closed surface S completely surrounds the volume dt, which we 
have written as an infinitesimal. 

Equation 2.76 can be applied to two adjacent differential volumes d7, and d7, that 
have a common surface, as shown in Figure 2.11: 


V-Adn+V-Adn=$ do-A+ J do’ (2.77) 
S, So 


The contributions to the surface integral from the common surfaces cancel out, as 
depicted in the figure, so Equation 2.77 can be written as 


V -Kdr,+¥-Adr -4 do@-K, (2.78) 


Si+2 


dt, a, 


A-do, +A-do, =0 
Figure 2.11 The Sum of Two Differential Volumes 
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Figure 2.12 The Sum of Two Differential Volumes 


where S;+2 is the outside surface enclosing both dt, and dt, as shown in Figure 
2.12. We can continue this process of adding up contiguous differential volume to 
form an arbitrary finite volume V enclosed by a closed surface S. The result, 


[av R= gao-a, (2.79) 
Vv AY 


is called Gauss’s Theorem. 


2.5.2 Green’s Theorem 
Green’s Theorem takes two forms and follows directly from Gauss’s Theorem and 
some vector operator manipulations. Start by considering the expression V - (uV v), 
where u and v are both scalar fields. Using a vector operator identity, which you will 
prove in one of this chapter’s exercises, this expression can be rewritten as 

V-(uVv) = Vu- Vv + Vv. (2.80) 
Switching u and v gives 

V - (vVu) = Vu -Vut oV7u. (2.81) 
When Equation 2.81 is subtracted from Equation 2.80, the result is 

v- (uVv) -V- (vVu) = uV2v —vV7u. (2.82) 


Finally, we integrate both sides of Equation 2.82 over a volume V and apply Gauss’s 
Theorem, to generate one form of Green’s Theorem: 


fas -(uVv — vVu) = [ature - vV7ul. (2.83) 
s v 


In this expression, the closed surface S surrounds the volume V. The same process 
applied directly to Equation 2.80 produces a second form of Green’s Theorem: 


$ do (uVv) = / dr[Vu- Vu +uV7vI. (2.84) 
S Vv 
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2.5.3 Stokes’s Theorem 


Stokes’s Theorem derives from Equation 2.74, 


(V XA) -d@ = lim ¢ aF-A, (2.85) 
c0 Je 
where the path C encloses the differential surface d@ in the right-hand sense. 
The development of Stokes’s theorem follows steps similar to those for Gauss’s 
theorem. Equation 2.85 is applied to two adjacent differential surfaces that have a 
common border, as shown in Figure 2.13. The result is 


(Vx A) ao, + (VA) do, = $ dr-A (2.86) 
Ci+2 

where the path C;+2 is the closed path surrounding both do; and doz. The line 
integrals along the common edge of C, and C2 cancel exactly. Any number of these 
differential areas can be added up to give an arbitrary surface S and the closed contour 
C which surrounds S. The result is Stokes’s Theorem: 


jaa (V XA) = fae-®. (2.87) 
S Cc 


There is an important consequence of Stokes’s Theorem for vector fields that have 
zero curl. Such a field can always be derived from a scalar potential. That is to say, if 
V XA = Oeverywhere, then there exists a scalar function ®(F) such that A = —V®. 
To see this, consider the two points 1 and 2 and two arbitrary paths between them, 
Path A and Path B, as shown in Figure 2.14. A closed line integral can be formed 
by combining Path A and the reversal of path B. If V x A = 0 everywhere, then 
Equation 2.87 lets us write 


[ar a- [ar-&= far-K- i (2.88) 
A B 


Figure 2.13 The Sum of Two Differential Surfaces 
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Point 2 
Path A 


Path B 
Point 1 


Figure 2.14 Stokes’s Theorem implies a Scalar Potential 


or 


jarx= parx (2.89) 
A B 


Equation 2.89 says the line integral of A between the two points is independent of 
the path taken. This means that it is possible to define a scalar function of position 
®(r) such that its total differential is given by 


d® = —d¥-A. (2.90) 


It is conventional to add the negative sign here so ® increases as you move against 
the field lines of A. Inserting Equation 2.90 into the line integrals of Equation 2.89 
shows that these integrals are both equal to 


2 
j —d® = @(1) — @(2). (2.91) 
1 


Referring back to Equation 2.33, it is clear that the condition of Equation 2.90 can 
be rewritten as 


A= -V®. (2.92) 


In summary, if the curl of a vector field is everywhere zero, the field is deriv- 
able from a scalar potential. The line integral of this type of vector field is always 
independent of the path taken. Fields of this type are often called conservative. 


2.5.4 Helmholtz’s Theorem 
Helmholtz’s Theorem states: 
A vector field, if it exists, is uniquely determined by specifying its divergence and curl 


everywhere within a region and its normal component on the closed surface surrounding 
that region. 
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There are two important parts of this statement. On one hand, it says if we have a 
field V that we are trying to determine, and we know the values of V - V and V X V 
at every point in some volume, plus the normal component of V on the surface of 
that volume, there is only one V that will make all this work. On the other hand, we 
have the disclaimer, “if it exists.” This qualification is necessary because it is entirely 
possible to specify values for the divergence, curl, and normal component of a vector 
field that cannot be satisfied by any field. 

To prove Helmholtz’s Theorem we will assume the two vector fields V, and V2 
have the same values for the divergence, curl, and normal component. Then we show 
that if this is the case, the two solutions must be equal. Let W = V, ~ V2. Since 
the divergence, curl, and the dot product are all linear operators, W must have the 
following properties: 


V-W=0 _ intheregion 
Vxw=o in the region (2.93) 
f-W=0 on the surface. 


Because V X W = 0, Wcan be derived from a scalar potential 


= -V®. (2.94) 


Green’s Theorem, in the form of Equation 2.84, is now applied with u = v = ® to 
give 


pas -O(V®) = [ a7|o¥ -(Vb) + V®-VO). (2.95) 
Js v 
Inserting Equation 2.94 into Equation 2.95 gives 
pas OW = [av WwW), (2.96) 
Js Vv 


Using Equations 2.93, the surface integral on the LHS and the volume integral of 
®V - Won the RHS are zero, with the result that 


[ dtW-W= i, d7iw\* = 0. (2.97) 
y 


y 


Because |W]? is always a positive quantity, the only way to satisfy Equation 2.97 is 
to have W = 0 everywhere. Consequently, V; = V2 and Helmholtz’s Theorem is 
proven. 

Helmholtz’s Theorem is useful for separating vectors fields into two parts, one 
with zero curl, and one with zero divergence. This discussion relies on two identities 


V-(V xA)=0 (2.98) 
Vx Vo =0, (2.99) 
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which you will prove at the end of the chapter. Write Vas 


V=VxA-V®. (2.100) 
Then we can write: 
V-V=—-V’o 
VxV=aVxVxA. 
a-V=a-(V xA-V®9), (2.101) 


Since the divergence, curl, and normal component are all specified if A and ® 
are specified, Helmholtz’s Theorem says V is also uniquely specified. Notice the 
contribution to V that comes from A has no divergence since V - (V X A) = 0. 
This is called the rotational or solenoidal part of the field and A is called the vector 
potential. The portion of V which arises from ® has no curl, because V xVo =0. 
This is called the irrotational part of the field and ® is called the scalar potential. 


EXERCISES FOR CHAPTER 2 
1. Consider the two-dimensional scalar potential function 
P(x, y) = x9 — By7x. 


(a) Make a plot of the equipotential contours for three positive and three negative 
values for ®, 


(b) Find V®. 

(c) Show that the V® field lines are given by setting 3x2y — y? equal toa series 
of constants. 

(d) Plot six representative V® field lines. Be sure to indicate the direction of 
the field and comment on its magnitude. 

(e) Find V - V9, the divergence of the V® vector field, and show that your 
field lines of part (d) agree with this divergence. 


2. An electric dipole p is located at the origin of a Cartesian system. This dipole 
creates an electric potential field @(F) given by 


pr 
Or) = ——— 
© 47e,|r[3 
where F is the position vector, F = x;é;. Let p = pé3. 


(a) Sketch the equipotential lines, ® = constant. 
(b) Find the electric field, E = —V®. 
(c) Sketch the electric field lines. 
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3. Perform the line integral 


fp atxy, 
(és 


where C is the contour shown below 


and 
(a) V = V,@,. 
(b) V = V,8@.. 
(c) V= Yok 8, + Voy _6 


x+y fx y? 9? 


4. Use subscript/summation notation to verify the following identities: 
(a) V- (fA) = f(V-A) +A-VF. 
(b) VXVxXA=V(¥-A)-V7A. 
(c) Vx (fA) = f(V XA) + VF XA 
(a) V- (AX B) =B-(V xA) —A-(V XB). 
(e) Ax (V XB) = (VB) -A- (A-V)B. 
(f) Vifg) = fVg + eV. 
(g) V(A-B) = Ax (V XB) + Bx (V XA) + (A-V)B+ (B-V)A. 
(th) VX VP =0. 
i vV- (V x A) =0. 
5. Calculate the work done by following a straight line path from the Cartesian 
point (1, 1) to (3, 3), if the force exerted is given by 


F = (x — y)& + (x + ye. 


Can this force be derived from a scalar potential? Pick any other path that goes 
from (1, 1) to (3, 3) and calculate the work done. 
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10. 
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. Let a vector field V be given by 


V = xé, + yé. 


(a) Use Stokes’s Theorem to evaluate the line integral, 


par-v, 
Cc 


where C is the unit circle in the xy-plane centered at (0, 0). 

(b) Use Stokes’s Theorem to determine the same line integral if C is a unit circle 
centered at (1,0). 

(c) Evaluate the two integrals above by actually performing the line integrals 
themselves. 


. Show that if B = V x A, then for any closed surface S, 


paa-B =o. 
AY 


. Use the differential operator theorems to evaluate 


paa-v, 
KY 


where S is the surface of a sphere of radius R centered at the origin, and V is 
(a) V = xé, + ye, + zé,. 

(b) V = x7@, + yé, + 2é,. 

(c) V = x8, + y@, + P%,. 


. Use an appropriate vector theorem to evaluate 


far-¥, 
Cc 


V = (2y? — 3x’y)é, + (Ary — 8). 


where C is any closed path and 


A force field is given by the relationship 
F = (2x + 2y)é + (2y + 2x)e,. 
(a) What is V - F? 


(b) What is V x F? 
(c) Sketch the force field lines. 


EXERCISES 43 


(d) This is a conservative field. What is its potential? 
(e) Sketch the equipotentials. 


11. The continuity equation for a fluid is 


V-J+ 


2|S 
T 
P 


where J = pV. 
(a) Show Vv - V = Oif the fluid is incompressible (constant density). 
(b) Apply Gauss’s Theorem to the continuity equation and interpret the result. 


12. Prove these integral forms of the differential operators: 


= ®do 
@re= tees. 
sv0 fdr 
ee do-A 
ve s 
b) VA an, fyat 
ze f,d0 XA 
(c) VX A= lim 
0 ty dt 
13. Maxwell’s equations in vacuum are 
gen stake oB 
VxE=-— 
ot 
=e JE 
Ho€o at 
V-E=0 
Vv -B=0. 
Manipulate these using subscript/summation notation to obtain the wave equation 
in vacuum: 
= 7E 
VE = 6. 


ot 
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CURVILINEAR COORDINATE 
SYSTEMS 


Up to this point, our discussions of vector, differential, and integral operations have 
been limited to Cartesian coordinate systems. While conceptually simple, these sys- 
tems often fail to utilize the natural symmetry of certain problems. Consider the 
electric field vector created by a point charge gq, located at the origin of a Cartesian 
system. Using Cartesian basis vectors, this field is 
— gq xe, + yé, + 2, 

Amey (x2 + y? + 2283/2" 
In contrast, a spherical system, described by the coordinates (7, 8, ), fully exploits 
the symmetry of this field and simplifies Equation 3.1 to 


(3.1) 


— q é, 
ents, G2) 


The spherical system belongs to the class of curvilinear coordinate systems. Basis 
vectors of a curvilinear system are orthonormal, just like those of Cartesian systems, 
but their directions can be functions of position. 

This chapter generalizes the concepts of the previous chapters to include curvi- 
linear coordinate systems. The two most common systems, spherical and cylindrical, 
are described first, in order to provide a framework for the more abstract discussion 
of generalized curvilinear coordinates that follows. 


3.1 THE POSITION VECTOR 


The position vector r(P) associated with a point P describes the offset of P from the 
origin of the coordinate system. It has a magnitude equal to the distance from the 
origin to P, and a direction that points from the origin to P. 
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> 


1 F(P) 


Figure 3.1 The Position Vector 


It seems natural to draw the position vector between the origin and P, as shown 
in Figure 3.1(a). While this is fine for Cartesian coordinate systems, it can lead 
to difficulties in curvilinear systems. The problems arise because of the position 
dependence of the curvilinear basis vectors. When we draw any vector, we must 
always be careful to specify where it is located. If we did not do this, it would not 
be clear how to expand the vector in terms of the basis vectors. To get around this 
difficulty, both the vector and the basis vector should be drawn emanating from the 
point in question. The curvilinear vector components are then easily obtained by 
projecting the vector onto the basis vectors at that point. Consequently, to determine 
the components of the position vector, it is better to draw it, as weil as the basis 
vectors, emanating from P. This is shown in Figure 3.1(b). There are situations, 
however, when it is better to draw the position vector emanating from the origin. For 
example, line integrals, such as the one shown in Figure 2.2, are best described in this 
way, because then the tip of the position vector follows the path of the integration. 
We will place the position vector as shown in Figure 3.1(a) or (b), depending upon 
which is most appropriate for the given situation. 

In Cartesian coordinates, the expression for the position vector is intuitive and 
simple: 


r= rj; ant xj8;. (3.3) 
The components (7;,72,73) are easily identified as the Cartesian coordinates 
(x1, ¥2,%3). Formally, r; is obtained by dot multiplying the position vector r by 
the basis vector @,: 


rj =@,-Fr = x). (3.4) 


While this may seem overly pedantic here, this technique can be used to find the 
vector components for any vector in any orthogonal coordinate system. 


3.2 THE CYLINDRICAL SYSTEM 


The coordinates of a point P described in a cylindrical system are (p, ¢,z). The 
equations 
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x = pcosd 
y= psingd (3.5) 
Z=2z 


and the corresponding inverse equations 


paris 
@ = tan”! (y/x) (3.6) 
Y ian é 


govern the relationship between cylindrical coordinates and the coordinates of a 
superimposed Cartesian system, as shown in Figure 3.2(a). 

The unit basis vectors for the cylindrical system are shown in Figure 3.2(b). Each 
basis vector points in the direction that P moves when the corresponding coordinate 
is increased. For example, the direction of é, is found by watching how P moves as 
p is increased. This method can be used to determine the directions of basis vectors 
for any set of coordinates. Unlike the Cartesian system, the cylindrical basis vectors 
are not fixed. As the point P moves, the directions of é, and éy both change. Also 
notice that if P lies exactly on the z-axis, i.e., p = 0, the directions of é, and é4 are 
undefined. 

The cylindrical coordinates, taken in the order (p, @, z), form a right-handed sys- 
tem. If you align your right hand along é,, and then curl your fingers to point in the 
direction of €,, your thumb will point in the @, direction. The basis vectors are also 
orthonormal since 


& -@, =@,°@, = @,-&, = 0 
é, -é = é, -é, = @,-é, = 1. (3.7) 


The position vector expressed in cylindrical coordinates is 


r= (F-6)€ + (F- ég)éy + (F- ee. (3.8) 
z 6, 
: S é, 
7 Pp  —?~p | 
é 
. > y _ DN : . y 
x x 
(a) (b) 


Figure 3,2 The Cylindrical System 
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x ‘ p 


Figure 3.3 The Position Vector in a Cylindrical System 


Notice that éy is always perpendicular to F, as shown in Figure 3.3, so Equation 3.8 
reduces to 


TF = r6) + rb. (3.9) 


The two-dimensional version of the cylindrical system, with only the (p, @) coor- 
dinates, is called a polar system. This system, shown in Figure 3.4(a), has basis vectors 
é, and é4. The position vector, shown in Figure 3.4(b), has only a p-component and 
is expressed as 


T = pe. (3.10) 


Remember an arbitrary vector V, unlike the position vector, can have both p- and $- 
components, as shown in Figure 3.5. 


y é, p 
ee 
Sp 
x Pea ‘ : x 
(a) (b) 


Figure 3.4 The Polar System 


Figure 3.5 Polar Components of a Vector 
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3.3 THE SPHERICAL SYSTEM 


The three coordinates (7,0, @) describe a point in a spherical coordinate system. 
Their relationship to a set of Cartesian coordinates is shown in Figure 3.6(a). The 
equations 


x =rsin@cos@ 
y=rsinOsind (3.11) 
z=rcos0 


and the inverses 


r=Vxerty2 +22 
¢ = cos! (<a) (3.12) 
aA 
b= cos! 
Vx? + y2 + 22 


allow conversion between the coordinates of the two systems. 

The unit basis vectors for the spherical system are shown in Figure 3.6(b). As with 
cylindrical coordinates, we obtain the direction of each basis vector by increasing 
the associated coordinate and watching how P moves. Notice how the basis vectors 
change with the position of the point P. If P lies on the z-axis the directions for €g 
and 4 are not defined. If P lies at the origin é, is also not defined. 

The spherical system, with the coordinates in the order (7, 8, d), is right-handed, 
just as in both the Cartesian and cylindrical systems. It is also an orthonormal system 
because 


é, -&y = 6, 8, = by 6, = 0 
é, é, = & # & = éy éy = |. (3.13) 
Z le as 
| e. 
P é 
| ® 
“A & 
i isl 
T: i x 
b Boe 
Si a 
x x 
(a) (b) 


Figure 3.6 The Spherical System 
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“3 é, r 
r 
4 é 
e rQ 
8 é, 
y @ 


x 


Figure 3.7 The Position Vector in Spherical Coordinates 


The position vector, shown in Figure 3.7, is expressed in the spherical system as 
F = (F- 6,6, + (F- Eg )€o + (TF: by Ey. (3.14) 
Because F is always perpendicular to ég and éy, Equation 3.14 simplifies to 


F = ré,. (3.15) 


3.4 GENERAL CURVILINEAR SYSTEMS 


Although the most common, cylindrical and spherical coordinate systems are just 
two examples of the larger family of curvilinear systems. A system is classified as 
curvilinear if it has orthonormal, but not necessarily constant, basis vectors. Other 
more esoteric curvilinear systems include the toroidal, hyperbolic, and elliptical 
systems. Instead of individually working out the vector operations of the previous 
chapter for each of these systems, we present a general approach that can tackle any 
curvilinear geometry. 


3.4.1 Coordinates, Basis Vectors, and Scale Factors 


The coordinates (4), g2, q3) and corresponding basis vectors (G1, G2, G3) will be used 
to represent any generic curvilinear system, as shown in Figure 3.8. Because these 


Z 4, 
4, 
P(qi,42,43) 


4; 


xX 


Figure 3.8 Curvilinear Coordinates and Basis Vectors 
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basis vectors are functions of position, we should always be careful to draw them 
emanating from a particular point, as we mentioned earlier in this chapter. 

In both the cylindrical and spherical coordinate systems, a set of equations existed 
which related these coordinates to a “standard” set of Cartesian coordinates. For the 
general case, we write these equations as 


Xi = Xi(Q1, 92,93) (3.16) 
qi = Qi(%1, X2, X3), (3.17) 


where the subscript notation has crept in to keep things concise. In both these equa- 
tions, the subscript i takes on the values (1, 2,3). The variables x; always represent 
Cartesian coordinates, while the g; are general curvilinear coordinates. 

An expression for q;, the unit basis vector associated with the coordinate g;, can 
be constructed by increasing q;, watching how the position vector changes, and then 
normalizing: 


q, = Flu, G.18) 
i 

where h; = |@F/0q;|. This equation is a little confusing, because there actually is 
no sum over the i index on the RHS, even though it appears twice. This is subtly 
implied by the notation, because there is an i subscript on the LHS. The h;, which 
are sometimes called scale factors, force the basis vectors to have unit length. They 
can be written in terms of the curvilinear coordinates. To see this, write the position 
vector in terms of its Cartesian components, which in turn are written as functions of 
the curvilinear coordinates: 


F = x;(91,92,93)€;- (3.19) 
Therefore, 
r é. j ? ’ aA 
OF _ 9xj(41,92 W)q, (3.20) 
04: 0qi 
and 


OF Oxy \* _ (dx\" _ (dx3\" 
i, = =| Z (2) e (=) + (2) (3.21) 
04; 0g: Ogi 04: 

The physical interpretation of the scale factors is quite simple. For a change dq, 
of the coordinate q;, the position vector changes by a distance of |dq,h,|. Therefore, 
using Equation 3.18, the displacement vector can be written in the curvilinear system 
as 


= dqihi(qi. 92, 93)4i, (3.22) 
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where there now is a sum over the index / on the RHS because there is no subscript on 
the LHS. Since the h; factors can change with position, a differential volume element 
in a curvilinear system is not necessarily a square cube as it is in Cartesian systems. 
Instead, as discussed in the next section, the sides of the differential volume in a 
curvilinear system vary in length and can pick up curvature. 


3.4.2 Differential Geometry 


Figure 3.9 depicts a differential surface enclosing an infinitesimal volume in a curvi- 
linear system. This figure will be the basis for the derivation, in general curvilinear 
coordinates, of the divergence and curl, as well as surface and volume integrals. 

The volume is formed by choosing a starting point (q;, q2, g3) and then constructing 
seven other vertices by moving away from this point with small changes of coordinates 
dqi, dq2, or dq3. In the differential limit, the lengths along each edge of this “warped 
cube” are given by the appropriate dq; times its scale factor. The scale factor is 
evaluated at a set of coordinates that corresponds to their initial value on the edge. If 
the coordinate q; stays at q; all along an edge, it is set equal to q;. If the coordinate q; 
is equal to g; + dq; all along an edge, it is set equal to q; + dq;. If the coordinate q; 


(q,,42,43+dqs) 


hs(q,,.92,q3)dq, 


(q,,42,q3) 


Figure 3.9 Differential Volume of a Curvilinear Coordinate System 
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goes from gq; to g; + dq; on an edge, it is set equal to g;. This is a somewhat cavalier 
way to treat the position dependence of these factors, although it does give all the 
correct results for our derivations. A more rigorous approach, which evaluates the 
mean value of the scale factors on each edge, is presented in Appendix B in a detailed 
derivation of the curvilinear curl. 

Following this approach, the differential element’s volume is simply 


, (3.23) 
(41.92.93) 


dt = dqydgzdg3hyhzh3 


where the h;’s are all evaluated at the point (q1, q2, q3). The differential surface of the 
bottom shaded side is 


‘ (3.24) 
(91.92.93) 


dG botom = ~dqydqzhyh243 


where the minus sign occurs because the surface normal is antiparallel to q3. In 
contrast, the differential surface of the top shaded side is 


(3.25) 


AG op = dqydqzhy hrs : 
(41.92.43 + dq3) 


The minus sign is absent because now the surface normal is parallel to @3. In this 
case, h,, hz, and the basis vector G3 are evaluated at the point (q1, q2,q3 + dq3). 


3.4.3 The Displacement Vector 


The displacement vector dF plays a central role in the mathematics of curvilinear 
systems. Once the form of df is known, equations for most of the vector operations 
can be easily determined. From multivariable, differential calculus, dF can be written 


dt = —dq;. (3.26) 
As we showed in Equation 3.22, this can be written using the scale factors as 
df = dqihi4. (3.27) 


In a Cartesian system g; = x;, Gj; = 6;, and h; = 1, so Equation 3.27 becomes the 
familiar 


d¥ = dx;@;. (3.28) 
In cylindrical coordinates, hy = hp = 1, hy = hy = p, and h3 = h, = 1 so 


dr = dq\q, + pdqoQ2 + dq34s 
= dpb, + pda, + dze,. (3.29) 
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3.4.4 Vector Products 
Because curvilinear systems are orthonormal, we have 
qi 4; = 5,- (3.30) 


This means that the dot product of two vectors, A and B, has the same form as in a 
Cartesian system: 


A B B = Aiqi : BjGj = A;B jj; = A;B;. (3.31) 


Here A; and B; are the curvilinear components of the vectors, which can be obtained 
by taking axis parallel projections of the vectors onto the basis vectors: 


A; = A> q&.- (3.32) 
With proper ordering, we can always arrange our three curvilinear coordinates to 
be right-handed. Thus, the form of the cross product is also the same as in a Cartesian 


system. The cross product of A and B expressed using the Levi-Civita symbol is 


A x B = AiG; x B,q; = A;B jQxEi jx- (3.33) 


3.4.5 The Line Integral 


Using the expression for the displacement vector in Equation 3.27, line integrals in 
curvilinear systems are straightforward: 


JC Cc 


There is a sum over both i and j on the RHS of this equation. Because the curvilinear 
basis vectors are orthonormal, this line integral becomes 


parv= [ aatvs (3.35) 
Cc Cc 


3.4.6 The Surface Integral 


Curvilinear surface integrals are a bit more complicated, because the orientations of 
the surfaces must be considered. Recalling Figure 3.9, and Equations 3.24 and 3.25, 
the surface integral of a vector V is 


jae y V = J 24adaahihaYs = dqzodq3hzh3V, a dq, dq3h\h3V2, (3.36) 
Ss BY 


where each plus or minus sign must be chosen depending on the sign of d@ - q;. 
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3.4.7 The Volume Integral 


The geometry of Figure 3.9 can be used to derive the form of volume integrals in 
curvilinear systems. The volume of the element, in the infinitesimal limit, is simply 
dt = dq\dqzdq3hh2h3. Therefore the integral of a function p(r) over any volume V 
is expressed as 


ja pr) = J dardaadashibahsplai.42 4). (3.37) 
Vv Vv 


3.4.8 The Gradient 
In Chapter 2, we showed how the gradient of a scalar function ® is defined such that 


d® =V® -dr. (3.38) 
Substitution of Equation 3.27 for d¥ yields 
d® = V® - dqjh;4qj. (3.39) 
Differential calculus implies d® = (0® /dq;)dq;, so 


a® = 2 


The only way Equation 3.40 can be satisfied is if 


= 1 (a® 
= — ( — )@. Al 
se (ea (3.41) 


The brackets are necessary in Equation 3.41 to keep the 0/dq; operator from acting 
on the basis vectors, because in general 04; /0q; # 0. 


3.4.9 The Divergence 


The divergence operation in a curvilinear system is more complicated than the gra- 
dient, and must be obtained from its integral definition 


(3.42) 


where S is the closed surface that encloses the vanishingly small volume V. 
Once again, consider the differential volume of Figure 3.9. The denominator of 
Equation 3.42 for this volume in the infinitesimal limit is straightforward: 


far = dq, dqzdq3h,hzh3. (3.43) 
v 
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To evaluate the numerator, integration over all six surfaces of V must be performed. 
First, consider the two shaded sides of Figure 3.9, with normals aligned either parallel 
or antiparallel to q3. The integral over the “bottom” surface is 


| d@- A = —dqidqx(hihnA3) (3.44) 
bottom 


(41.42.43) 
The minus sign arises because on this surface d@ and qj; are antiparallel. Also notice 
that A3, A), and hp are all functions of the curvilinear coordinates and are evaluated 
at (41, 92, 43), the initial values of the coordinates on this surface. The integral over 
the “top” surface is 


(3.45) 


/ d@ -A = +dqdqn(hyhnA3) 
top (41.92.93 + dq3) 


In this case there is no minus sign because this surface normal is oriented parallel 
to 43. The initial value of the q3 coordinate for this surface has changed by an amount 
dq3 as compared to the bottom surface and thus A3, 4, and 2 must all be evaluated 
at the point (4), g2,q3 + dq3). In the differential limit 


Oh h2A 
(ayh2As) =(ihzAy| + SBE) ag, 8.46) 
(41.42.43 +4q3) (41.42.43) 4q3 
(41,9293) 
so the sum of Equations 3.44 and 3.45 is 
— hihnA 
| d@-A= RRA) 5 dada: (3.47) 
both 943 


Combining this result with similar integrations over the remaining four surfaces gives 


[aa-a= | a 4 Ahrh3A2) , Whih2As) 
S oq) 042 443 


| dqidqzdq3. (3.48) 


Substituting Equations 3.43 and 3.48 into Equation 3.42 gives the result 


1 eens i 0(hyh3A2) a sed 
041 0q2 0q3 | 


(3.49) 


3.4.10 The Curl 
The curl operation for a curvilinear coordinate system can also be derived from its 
integral definition: 
V xA-lim | d@= lim ¢ dr-A, (3.50) 


S50 Js C0 Jc 


where C is a closed path surrounding the surface S, and the direction of d@ is defined 
via C and a right-hand convention. 
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1 
Figure 3.10 Orientation of Surface for Curvilinear Curl Integration 
A single component of the curl can be picked out by orienting do@ along the 
direction of a basis vector. Consider Figure 3.10, where d@ is oriented to pick out the 


q: component. In this case, d@& = hydqzh3dq34,, so the left side of Equation 3.50 in 
the differential limit becomes 


VxA- lim | do = dandgshyhs Gy - [V XA]. (3.51) 
s—0 S » mean amen 
[Wx], 


The line integral on the right side of Equation 3.50 naturally divides into four parts 
along C,, C,, C,, and Cy, as shown in Figure 3.11. The complete integral is then 
given by 


gar-K= | dgshaA, + dgxhata + darhoA, + | dq3h3A3. (3.52) 
Cc Ca Cc, C. Ca 


Noid 


3 ee 
- 


(41,42,43+dq;3) 


(41.4243) % wh 
l 


Figure 3.11 Differential Geometry for Curvilinear Curl Integrations 
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In the differential limit, the integral along C, evaluates to 


dq2, (3.53) 


| dqyhnAz = (hrA2) 
Ca (41,92,93) 


where we evaluate both A2 and hj at the point (q1, 42, q3). Likewise, the contribution 
along C;, is 


dq, (3.54) 


(41,42,93 + dq3) 


| dqzhzAz = —(hzA2) 
C. 


where now we evaluate Az and hp at (g1, g2,q3 + dq3). In the differential limit, 


pe A(h2A2) 


(h2A2) 
2A das 


= (hyA) 
(41.42.43 +dq3) 


dqs, (3.55) 


(41.92.93) (41592593) 


which allows the integrals along C, and C, to be combined to give 


= h 
| goog AD) 
Cat Ce 4q3 


dq2dq3. (3.56) 
(41.9243) 


Similar integrations can be performed along C, and Cz. The combination of all four 

pieces yields 

A(h3A3) — A(h2A2) 
dq2 oq3 


Substituting Equations 3.57 and 3.51 into Equation 3.50 gives the 1-component of 
the curl of A: 


lim ¢ A-dt= 


d ; 3.57 
Lae q2dq3 (3.57) 


Wxa],-2! [sae eat (3.58) 


1 Iyhs | 0q2 0q3 


The other components of V x A can be obtained by reorientating the surface shown 
in Figure 3.10. The results are 


-_ 1 fatyA) a) 

VxA = 3.59 

IM 42 hyhy | 0q3 oq, 
a 1 [d(hzA2) ae 

Vv xAl,=— ae. 3.60 

[Vv “3 hyh oq oq2 


Equations 3.58—3.60 can be written more compactly using a determinant, 


1 yg: M@ ha 
8/dq, a/dq, 93/0q3 ; G.61) 
WAG teAd. (tah 


det 
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or using the Levi-Civita symbol and subscript notation, 


qi. (3.62) 


3.5 THE GRADIENT, DIVERGENCE, AND CURL IN CYLINDRICAL 
AND SPHERICAL SYSTEMS 


3.5.1 Cylindrical Operations 
In the cylindrical system h, = h, = 1, hy = hg = p,andh; = h, = 1. The gradient, 


divergence, and curl become 


V® = 


ee nee. a® , a® 4. (3.63) 


peace + — 
p p ag ** 0 
7. = 190A) | 144g , Az 
p op p op ed 


(3.64) 


1 [a(pAg) a4 : 
+= — —!14q, 3.65 
p | op ab | 4 ce 


3.5.2 The Spherical Operations 


In the spherical system h, = h, = 1, hy = hg = r, andh; = hg = rsin@. The 
gradient, divergence, and curl become 


oP, 10®, 1 o@, 


Vm = —q, + + : 
2 ov" + 990? * Fsind ab ‘Is Ce) 
= ~ 1 arA,) 1 asin 0A@) 1 dAg 
V-A= + + 3.67 
r2 or rsin@ 00 rsin@ od Geo) 
aes 1 [a(sin@Ag) Ae] . 
VxA= ‘ 
: r sin 6 | a6 ap 
1f 1 0A, — a(rAg)] . 
r |sin@ dd or 46 


1 [O(rAg) OA, |. 
+ — —- — ‘ 3.68 
r or 0g | ds (3-68) 
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EXERCISES FOR CHAPTER 3 


1. 


» 


A vector expressed in a Cartesian system has the following form: 
V =6, +6, + @,. 


(a) If this vector exists at the Cartesian point (1,2, 1), what are its components 
expressed in a cylindrical system? 

(b) If this vector exists at the Cartesian point (1,2, 1), what are its components 
expressed in a spherical system? 

(c) If this vector exists at the Cartesian point (0, 0, 1), what are its components 
expressed in a cylindrical system? 

(d) If this vector exists at the Cartesian point (0,0, 1), what are its components 
expressed in a spherical system? 


. In a cylindrical system, with coordinates (p, @, z), the vector V; exists at the 


point (1,0, 0) and is given by Vi = €, + €4. A second vector V> exists in the 
same coordinate system at the point (1, 1/2, 0) and is given by V2 = é + ey. 
What is the value of V; - V2? 


A circular disk, centered at the origin of a coordinate system, is rotating at a 
constant angular velocity w,. 


\e 


(a) In Cartesian coordinates, what is the velocity vector of the point P located 
at the Cartesian coordinates (1, 1)? 

(b) In polar coordinates, what is the velocity vector of the point P located at the 
Cartesian coordinates (1, 1)? 


. Perform the integral 


where 


F = 2 sin@ sind é, + 3 cos@ cosh é, + 4 cos @ cos oh &, 
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and § is one-eighth of a spherical shell of radius 2, centered at the origin of 
the spherical system. The integration region lies in the positive xyz-octant of a 
Cartesian system, as shown below: 


5. Perform the line integral 


[ate 
Cc 


where ég is a spherical basis vector, and C is the quarter circle of unit radius lying 
in one quadrant of the Cartesian xz-plane shown below. Evaluate this integral in 
three ways, using Cartesian, cylindrical, and spherical coordinate systems. What 
happens to the value of the integral if the direction along C is reversed? 


6. Perform the following line integrals along one quarter of a unit radius circle, 
traversed in a counterclockwise direction. The center of the circle is at the origin 
of a Cartesian coordinate system, and the quarter circle of interest is in the first 
xy-quadrant. 

(a) {.d¥- V, where V = 2x é,. 
(b) {.d¥ ®, where ® = 2x. 
(c) Repeat parts (a) and (b) using a polar coordinate system. 


7. Determine V - r, where F is the position vector 
r=xé,+yé, + 276, 


and show that V - F = 3. Repeat this calculation using spherical coordinates. 
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8. Derive the h;,’s for the cylindrical and spherical systems. Also derive the expres- 
sions given in the chapter for the gradient, divergence, and curl operators in these 
systems. 


9. Redo Exercise 8 of Chapter 2 using spherical coordinates. 


10. A Tokamak fusion device has a geometry that takes the shape of a doughnut 
or torus. Calculations for such a device are sometimes done with the toroidal 
coordinates shown in the figure below. 


|———-_ Major Axis 


a TAs mee 


Minor Axis ~ —— 


The “major axis” of the device is a vertical, straight line that forms the toroidal 
axis of symmetry. The “minor axis” is a circle of fixed radius R, that passes 
through the center of the doughnut. The position of a point is described by the 
coordinates (r, 6, @). The coordinates r and @ are similar to a two-dimensional 
polar system, aligned perpendicular to the minor axis. The coordinate @ measures 
the angular position along the minor axis. 


(a) Make a sketch of the unit basis vectors for this toroidal system. Are these 
vectors orthogonal? Do they form a right-handed system? 


(b) Obtain expressions relating the toroidal coordinates to a set of Cartesian 
coordinates. 


(c) Obtain the h; scale factors for the toroidal system. 


(d) Write expressions for the displacement vector df, a differential surface area 
do, and a differential volume dt in this system. 


(e) Write expressions for the gradient V®, divergence V - A, and curl V x A 
operations in this system. 


(f) Laplace’s equation written in vector notation is V7 = 0. What is Laplace’s 
equation expressed in these toroidal coordinates? 
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11. 


12. 


13. 
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A second toroidal system using coordinates (é, n, ¢) can be formed, with these 
coordinates related to the Cartesian coordinates by: 


_ asinh ncos 
~ cosh y — cos é 
_ asinh 7sin 
cosh y — cosé 
asiné 
cosh 1 — cos 


(a) Describe the surfaces of constant € and constant 7. 

(b) Pick a point and sketch the basis vectors. Is this a right-handed system? 
(c) Obtain the h; scale factors for this toroidal system. 

(d) Express the position and displacement vectors in this system. 


The (u, v, z) coordinates of an elliptical system are related to a set of Cartesian 
coordinates by the equations: 


x = acoshucosv 
y = asinhusinv 
ZS 2Z, 


(a) Sketch the lines of constant u and constant v on a two-dimensional xy-grid. 

(b) Obtain the h; scale factors associated with this elliptical system. 

(c) Obtain the form of a differential path length, as well as the area and volume 
elements used to form line, surface, and volume integrals in this elliptical 
system. 

(d) Obtain expressions for the gradient, divergence, and curl in the elliptical 
system. 

(e) Express the position and displacement vectors in the elliptical system. 


A hyperbolic (u, v, z) coordinate system is sometimes used in electrostatics and 
hydrodynamics. These coordinates are related to the “standard” Cartesian coor- 
dinates by the following equations: 


(a) Sketch the lines of constant u and constant v in the xy-plane. Note that these 
lines become surfaces in three dimensions. 

(b) Indicate the directions of the basis vectors 4, and q, in all four quadrants. 
Is this system orthogonal? 
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14. 


15. 


16. 


17. 


(c) This (u, v,z) system is left-handed. What can be done to make it right- 
handed? 


(d) In the right-handed version of this system, obtain the h; scale factors. 
(e) Express the position and displacement vectors in this system. 


An orthogonal (u, v,z) coordinate system is defined by the following set of 
equations that relate these coordinates to a “standard” Cartesian set: 


x 
MS + y? 
ee See 
x2 + y?2 
Z=2Z. 


(a) Sketch the lines of constant u and constant v in the xy-plane. 

(b) Pick a point in the xy-plane and sketch the q., @v, and 4, basis vectors. It 
is not necessary to solve for them in terms of the Cartesian basis vectors é,, 
é,, and é,. 

(c) Is this (u, v, z) system right-handed? 

(d) Invert the above expressions and solve for x and y in terms of u and v. 

(e) Express the position and displacement vectors in this system. 

(f) Express the gradient operation Vu, v, z) in this system. 

A particle is moving in three-dimensional space. Describe its position F(t), ve- 

locity F, and acceleration r in terms of the the spherical coordinates and basis 


vectors. To work this problem it will be necessary to take time derivatives of the 
basis vectors, because they will change as the particle moves in space. 


A particle is moving in a circular orbit with its position vector given by 
F = ro COS(Wpt) é, + ro sin(wot) é,, 
where r, and w, are constants. If r and F are the first and second derivatives of Ff, 
(a) Sketch the orbit in the xy-plane. 
(b) Evaluate F - F and discuss your result. 
(c) Show that F + wr = 0. 


Consider a scalar function ® of position which depends on the cylindrical coor- 
dinates (p, @, z) such that ® = p cos¢. 
(a) Plot the surfaces of constant ® in the xy-plane. 


(b) Let the vector V = V®, and find the radial V, and azimuthal V, components 
of V. 


18. 


19. 


20. 
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(c) The field lines for V can be described by a function p = p(#). Show that 
p(@) satisfies the equation 


dp(d) _ pV, 


dd Vg 
(d) Solve the above equation for p(@), and plot the V field lines on the same 
graph you plotted the surfaces of constant ®. 


A flat disk rotates about an axis normal to its plane and passing through its center. 
Show that the velocity vector V of any point on the disk satisfies the equations 


V-v=0 
V xv = 20, 
where @ is the angular velocity vector of the disk. This vector is defined as 
rxyv 
r2 


@o= 


Consider a sphere of radius r,, rotating at a constant angular rate w, about its 
Z-axis so that 


@ = wé,. 


(a) Find an expression for the velocity vector V of points on the surface of the 
sphere using spherical coordinates and spherical basis vectors. Remember 
V=@Q@%XYF. 

(b) Perform the integration 


par-v 


around the equator of the sphere. 
(c) Now perform the surface integral 


paaxs 


over the entire surface of the sphere. 
The magnetic field inside an infinitely long solenoid is uniform 
B = B.é,. 


Determine the vector potential A such that V x A = B. For this situation, the 
magnetic field can also be derived from a scalar potential, B = —V®. Why is 
this the case and what is ®? 
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21. 


22. 


The magnetic field inside a straight wire, aligned with the z-axis and carrying 
a uniformly distributed current, can be expressed using cylindrical coordinates 
as 
Bop 
Po 


B= 


és, 


where B, and p, are constants. In this case the magnetic field can still be obtained 
from a vector potential V < A = B. Find this A. Now, however, it is no longer 
possible to find a scalar potential such that B = —V®. Why is this the case? 


A static magnetic field is related to the current density by one of Maxwell’s 
equations, 


VxB= Lod. 
(a) If the magnetic field for p < pz is given, in cylindrical coordinates, by 


Bop” 


B= 
p2 


by, 


how is the current density J distributed? 
(b) If B = 0 for p > po, how much current must be flowing in a cylindrical 
shell at p = po? 


Classically, the angular momentum L is given by 


L=rxp, 


where F is the position vector of an object and P is its linear momentum. 
In quantum mechanics, the value of the angular momentum is obtained by 
letting a vector differential operator £,, act on a wave function Vv: 


Angular Momentum = £,, V. 


The angular momentum operator can be obtained using the correspondence prin- 
ciple of quantum mechanics. This principle says that the operators are obtained 
by replacing classical objects in the classical equations with operators. Therefore, 


Lop Rop x Pop. 
The position operator is just the position vector 
Rop = F. 
The momentum operator is 
Poy = —iAV, 


where i is the square root of —1 and # is Planck’s constant divided by 27. Show 
that, in Cartesian geometry, 
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Fz [9 a), 
Lop = -in[y = ar 25, |e 
te) to] 
“ales -23) 
thio — x5- 18 
-in[xe i yale, 
oy é 


23. The derivation of the curvilinear divergence in this chapter is somewhat sloppy, 
because it treats the scale factors as constants along each surface of the volume. 
Derive the same expression for the divergence followmg the more ngorous 
method demonstrated in Appendix B for the curvilinear curl. 


4 


INTRODUCTION TO TENSORS 


Tensors pervade all the branches of physics. In mechanics, the moment of inertia 
tensor is used to describe the rotation of rigid bodies, and the stress-strain tensor 
describes the deformation of elastic bodies. In electromagnetism, the conductivity 
tensor extends Ohm’s law to handle current flow in nonisotropic media, and Maxwell’s 
stress tensor is the most elegant way to deal with electromagnetic forces. The metric 
tensor of relativistic mechanics describes the strange geometry of space and time. 

This chapter presents an introduction to tensors and their manipulations, first 
strictly in Cartesian coordinates and then in generalized curvilinear coordinates. We 
will limit the material in this chapter to orthonormal coordinate systems. Tensors 
in nonorthonormal systems are discussed later, in Chapter 14. At the end of the 
chapter, we introduce the so-called “pseudo”-objects, which arise when we consider 
transformations between right- and left-handed coordinate systems. 


4.1 THE CONDUCTIVITY TENSOR AND OHM’S LAW 


The need for tensors can easily be demonstrated by considering Ohm’s law. In an 
ideal resistor, Ohm’s law relates the current to the voltage in the linear expression 


(4.1) 
In this equation, / is the current through the resistor, and V is the voltage applied 


across it. Using MKS units, J is measured in amperes, V in volts, and R in ohms. 
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Equation 4.1 describes the current flow through a discrete element. To apply Ohm’s 
law to a distributed medium, such as a crystalline solid, an alternative form of this 
equation is used: 


J = cE. (4.2) 


Here J is the current density, E is the electric field, and o is the material’s conductivity. 
In MKS units, J is measured in amperes per meter squared, E in volts per meter, and 
o in inverse ohm-meters. 

Equation 4.2 describes a very simple physical relationship between the current 
density and the electric field, because the conductivity has been expressed as a scalar. 
With a scalar conductivity, the amount of current flow is governed solely by the 
magnitudes of o and E, while the direction of the flow is always parallel to E. But in 
some materials, this is not always the case. Many crystalline solids allow current to 
flow more easily in one direction than another. These nonisotropic materials must have 
different conductivities in different directions. In addition, these crystals can even 
experience current flow perpendicular to an applied electric field. Clearly Equation 
4.2, with a scalar conductivity, will not handle these situations. 

One solution is to construct an array of conductivity elements and express Ohm’s 
law using matrix notation as 


Ji Ou O12 O13 Ey 
Jo} = | 02 On O23] | Er]. (4.3) 
Jy 031 932 033} | E3 


In Equation 4.3, the current density and electric field vectors are represented by 
column matrices and the conductivity is now a square matrix. This equation can be 
written in more compact matrix notation as 


[J] = [oJ[E] (4.4) 
or in subscript/summation notation as 
Jj = o,;E;. (4.5) 


All these expressions produce the desired result. Any linear relationship between 
J and E can be described. The 1-component of the current density is related to the 
1-component of the electric field via 0),, while the 2-component of the current density 
is related to the 2-component of the electric field through a2). Perpendicular flow is 
described by the off-diagonal elements. For example, the oj element describes flow 
in the |-direction due to an applied field in the 2-direction. 

The matrix representation for nonisotropic conductivity does, however, have a 
fundamental problem. The elements of the matrix obviously must depend on our 
choice of coordinate system. Just as with the components of a vector, if we reorient 
our coordinate system, the specific values in the matrix array must change. The matrix 
array itself, unfortunately, carries no identification of the coordinate system used. The 
way we solved this problem for vector quantities was to incorporate the basis vectors 
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directly into the notation. That same approach can be used to improve the notation for 
the nonisotropic conductivity. We define a new object, called the conductivity tensor, 
which we write as @. This tensor includes both the elements of the conductivity 
matrix array and the basis vectors of the coordinate system in which these elements 
are valid. Because this notation is motivated by the similar notation we have been 
using for vectors, we begin with a quick review. 

Remember that a vector quantity, such as the electric field, can be represented by 
a column matrix: 


E—|E|. (4.6) 
E3 


The vector and matrix quantities are not equal, however, because the matrix cannot 
replace E in a vector equation and vice versa. Instead, the basis vectors of the 
coordinate system in which the vector is expressed must be included to form an 
equivalent expression: 


E = E,é,. (4.7) 


The nonisotropic conductivity tensor @ can be treated in a similar way. It can be 
represented by the matrix array 


os Oi, Giz G13 
O@-— |021 On O21, (4.8) 
03, 9832 G33 


but the matrix array and the tensor are not equivalent because the matrix array contains 
no information about the coordinate system. Following the pattern used for vectors 
and the expression for a vector given in Equation 4.7, we express the conductivity 
tensor as 


o ae oij€e;. (4.9) 


The discussion that follows will show that this is a very powerful notation. It supports 
all the algebraic manipulations that the matrix array notation does and also handles 
the transformations between coordinate systems with ease. 

The expression for the conductivity tensor on the RHS of Equation 4.9 contains the 
elements of the conductivity matrix array and two basis vectors from the coordinate 
system in which the elements are valid. There is no operation between these basis 
vectors. They serve as “bins” into which the o;; elements are placed. There is a 
double sum over the subscripts i and j and so, for a three-dimensional system, there 
will be 9 terms in this sum, each containing two of the basis vectors. In other words, 
we can expand the conductivity as 


C= ) ) G88; = 01168 + 01 28)@, + G2 @.@) + °°. (4.10) 
1. Sg 
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This is analogous to how a vector can be expanded in terms of its basis vectors as 


V = So Vidi = Vids + Vabo + V36s. (4.11) 


Let’s see how this new notation handles Ohm’s law. Using the conductivity tensor, 
we Can write it in coordinate-independent “vector/tensor”’ notation as 


J=c-E. (4.12) 


Notice the dot product between the conductivity tensor and the electric field vector 
on the RHS of this expression. We can write this out in subscript/summation notation 
as 


Is@s = (O jb j€x) * (Er€:). (4.13) 


By convention, the dot product in Equation 4.13 operates between the second basis 
vector of @ and the single basis vector of E. We can manipulate Equation 4.13 as 
follows: 


J,@s = GO jE 8 je, . é (4.14) 
J,é, = Op E1€ j 5x (4.15) 
JIs€, = C jpEZe;. (4.16) 


The quantities on the left- and right-hand sides of Equation 4.16 are vectors. The 
i* components of these vectors can be obtained by dot multiplying both sides of 
Equation 4.16 by é; to give 


Ji = OnE, (4.17) 


which is identical to Equations 4.3-4.5. Keep in mind that there is a difference 
between @ - E and E - G. The order of the terms matters because, in general, 


The basis vectors in this tensor notation serve several functions: 


. They establish bins to separate the tensor components. 
. They couple the components to a coordinate system. 
. They set up the formalism for tensor algebra operations. 


. As shown later in the chapter, they also simplify the formalism for transforma- 
tions between coordinate systems. 


wn = 


Now that we have motivated our investigation of tensors with a specific example, 
we proceed to look at some of their more formal properties. 
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4.2 GENERAL TENSOR NOTATION AND TERMINOLOGY 


The conductivity tensor is a specific example of a tensor that uses two basis vectors 
and whose elements have two subscripts. In general, a tensor can have any number 
of subscripts, but the number of subscripts must always be equal to the number of 
basis vectors. So in general, 


T = Ti jx G8 /& as (4.19) 


The number of basis vectors determines the rank of the tensor. Notice how the tensor 
notation is actually a generalization of the vector notation used in previous chapters. 
Vectors are simply tensors of rank one. Scalars can be considered tensors of rank 
zero. Keep in mind that the rank of the tensor and the dimension of the coordinate 
system are different quantities. The rank of the tensor identifies the number of basis 
vectors on the right-hand side of Equation 4.19, while the dimension of the coordinate 
system determines the number of different values a particular subscript can take. For 
a three-dimensional system, the subscripts (i, j,k, etc.) can each take on the values 
C, 2, 3). 

This notation introduces the possibility of a new operation between vectors called 
the dyadic product. This product is either written as A:B or just A B. The dyadic 
product between vectors creates a second-rank tensor, 


AB = Aj€,B j@; aan A,B j€;é;. (4.20) 


This type of operation can be extended to combine any two tensors of arbitrary rank. 
The result is a tensor with rank equal to the sum of the ranks of the two tensors in 
the product. Sometimes this operation is referred to as an outer product, as opposed 
to the dot product, which is often called an inner product. 


4.3 TRANSFORMATIONS BETWEEN COORDINATE SYSTEMS 


The new tensor notation of Equation 4.19 makes it easy to transform tensors between 
different coordinate systems. In fact, many texts formally define a tensor as “an object 
that transforms as a tensor.” While this appears to be a meaningless statement, as will 
be shown in this section, it is right to the point. 

In this chapter, only transformations between orthonormal systems are consid- 
ered. First, transformations between Cartesian systems are examined and then the 
results are generalized to curvilinear systems. The complications of transformations 
in nonorthonormal systems are deferred until Chapter 14. 


4.3.1 Vector Transformations Between Cartesian Systems 


We begin by looking at vector component transformations between two simple, two- 
dimensional Cartesian systems. A primed system is rotated by an angle @, with 
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Figure 4.1 Rotated Systems 


respect to an unprimed system, as shown in Figure 4.1. A vector V can be expressed 
with unprimed or primed components as 


V = Ve; = Vie. (4.21) 
From the geometry of Figure 4.2, it can be seen that the vector components in the 


primed system are related to the vector components in the unprimed system by the 
set of equations: 


Vi = Cos 0,V; + sin 6,V> 
(4.22) 
V; = — sin 0,V\ + cos 6,V>. 
These equations can be written in matrix notation as 
(V'] = [a]{V], (4.23) 


where [V"] and [V] are column matrices representing the vector V with primed and 
unprimed components, and [a] is the square matrix: 


[a] = | cc a oe a (4.24) 


Figure 4.2. Vector Components 
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4.3.2. The Transformation Matrix 


In general, any linear coordinate transformation of a vector can be written, using 
subscript notation, as 


V! = aijV;, (4.25) 


where [a] is called the transformation matrix. In the discussion that follows, two 
simple approaches for determining the elements of [a] are presented. The first assumes 
two systems with known basis vectors. The second assumes knowledge of only the 
coordinate equations relating the two systems. The choice between methods is simply 
convenience. While Cartesian coordinate systems are assumed in the derivations, 
these methods easily generalize to all coordinate transformations. 


Determining [a] from the Basis Vectors If the basis vectors of both coordinate 
systems are known, it is quite simple to determine the components of [a]. Consider 
a vector V expressed with components in two different Cartesian systems, 


V=V.& = Vié. (4.26) 
Substitute the expression for V/ given in Equation 4.25 into Equation 4.26 to obtain 
V8; = ajjVj€. (4.27) 


This must be true for any V. In particular, let V = 6,,, one of the unprimed basis 
vectors (in other words, Vi¢m = O and Vi=m = 1), to obtain 


@n = Gim®}. (4.28) 
Dot multiplication of é/ on both sides yields 
Anm = (6, * &m)- (4.29) 


Notice the elements of [a] are just the directional cosines between all the pairs of 
basis vectors in the primed and unprimed systems. 


Determining [a] from the Coordinate Equations If the basis vectors are not 
explicitly known, the coordinate equations relating the two systems provide the 
quickest method for determining the transformation matrix. Begin by considering the 
expressions for the displacement vector in the two systems. Because both the primed 
and unprimed systems are Cartesian, 


d¥ = ax,6; = dx/é}, (4.30) 


where the dx/ and dx, are the total differentials of the coordinates. Because Equation 
4.25 holds for the components of any vector, including the displacement vector, 


dx} = ajjdXx;. (4.31) 
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Equation 4.31 provides a general method for obtaining the elements of the matrix 
{a] using the equations relating the primed and unprimed coordinates. Working in 
three dimensions, assume these equations are 

x Figs x41, x2, x3) 


xX} = x4(%1, X2, X3) (4.32) 


i 
X5 = X5(X1,X2,.%3); 
or more compactly, 
x} = x{(X1,X2, X3). (4.33) 


Expanding the total differentials of Equations 4.32 gives 


Ox! (x1, X2,X 
6X1, X2 3) a5 


Ox! (x1, X2, x Ox! (x1, X2,X 
dxi = 11 ¥20%3) py 4 113 %2%3) yy 4 


Ox, OX2 Ox3 
0x5 (x1, X2,X ax5 (x1, X25 X Ox! (x1, X2, Xx 
f= (X41, X2 3) ax, + (x1, X2 3D) ax, a g(%1, X2 3) ax, 
Ox) OXx2 0x3 
Ox4(X1, X25 X Oxh(X1, X2, X Ox4(X1, X2,X 
Fae 3(%1, X2 Dax, 4 9341 x2 3) dey 4 2M %2 3) xs. 


Ox, 0X2 OX3 
Again, using subscript notation, this is written more succinctly as 
to) H ? ts 
dx} = SEC) 5. (4.34) 
Ox j 
Comparison of Equation 4.31 and Equation 4.34 identifies the elements of [a] as 


_ 9X;(%1,X2,X3) 


4.35 
= (4.35) 


aij 


The Orthonormal Property of [a] _ If the original and the primed coordinate systems 
are both orthonormal, a useful relationship exists among the elements of [a]. It is 
easily derived by dot multiplying both sides of Equation 4.28 by &: 


Ae ce AL 
ej = aije; 

é; ° é; = aj;(; 7 €;) (4.36) 
6; = Ajj iz. 


Equation 4.36 written in matrix form is 
[alfa]' = (1), (4.37) 


where [a]' is the standard notation for the transpose of [a], and the matrix [1] is a 
square matrix, with 1’s on the diagonal and 0’s on the off-diagonal elements. 
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The Inverse of [a] The matrix [a] generates primed vector components from 
unprimed components, as indicated in Equation 4.25. This expression can be inverted 
with the inverse of [a], which is written as [a]~' and defined by 


[a][a]~' = [a]~*[a] = [1], (4.38) 
or in subscript notation 
ajj'aje = aijajy = S- (4.39) 
The subscript notation handles the inversion easily: 
V/ = ajjV; 
aj! Vi = ag haijV; 


a;,,'V! = 8 Vj 


anv) = Vy. 


(4.40) 


Transformation matrices which obey the orthonormality condition are simple to 
invert. Comparison of Equation 4.37 and Equation 4.38 shows that 


[aJ-' = [ay’, (4.41) 
or in subscript notation, 
a;;' = aj. (4.42) 
The inverse relation then becomes 


Vi = Aji V;. (4.43) 


Basis Vector Transformations The unprimed basis vectors were related to the 
primed basis vectors by Equation 4.28: 


é; = ajid},. (4.44) 


Using the fact that the inverse of the [a] matrix is its transpose, this expression can 
be inverted to give the primed basis vectors in terms of the unprimed ones 


= aj ;@;. (4.45) 
Remember that these expressions are only valid for transformations if both the primed 


and unprimed systems are orthonormal. 


4.3.3 Coordinate Transformation Summary 


The box below summarizes the transformation equations between two Cartesian 
coordinate systems: 
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~ 


_— - @ = :-@ - 
Vi = aij Vj €; = aij; 


a / As at al 
V; = ajiV; ej = Aji€; 


aij = (8 + 8j) = Ax}(x1, x2, %3)/Ox; 


The functions xj = x/(x1,x2,x3) relate the primed Cartesian coordinates to the 
unprimed Cartesian coordinates. To help keep things straight, notice there is a pattern 
to these transformation equations. Every time we convert from the unprimed system 
to the primed system, whether we are dealing with a basis vector or the components 
of some vector, we sum over the second subscript of a;;. In contrast, conversions 
from the primed system to the unprimed system always sum over the first subscript. 


4.3.4 Tensor Transformations 


To understand why the elements of a tensor must change values when expressed 
in different coordinate systems, consider the conductivity tensor. If in one set of 
coordinates, current flows more easily in the 1-direction than the 2-direction, then 
1, > 2. Observation of the same physical situation in a new, primed coordinate 
system where the 1/-direction is equivalent to the original 2-direction and the 2’- 
direction is the same as the original 1-direction, would show that o{; < 03). Clearly 
the elements of the conductivity tensor must take on different values in the two 
systems, even though they are describing the same physical situation. This is also 
true of a vector quantity; the same velocity vector will have different components in 
different coordinate systems. 

Tensor transformations follow the same pattern as vector transformations. A vector 
expressed in a primed and unprimed system is still the same vector: 


V = Vie = Viel. (4.46) 


Likewise, using the notation of Equation 4.19, the expressions for a second-rank 
tensor in the two systems must obey 


(4.47) 


Herein lies the beauty of this notation. The relationship between the elements, 7;; 
and T,,,, is built into Equation 4.47 and is easily obtained by applying two successive 
dot products to both sides. The first dot product yields 


lalal 
€,€, 


é ; 7; j8€; = é : T,; 
a ne ae lsaA Al\al 
T;;(€ : é)é; = T,,,(@ ‘€, )e; (4.48) 
2 — Tl, al 
Tj 5n€; = T,,an€, 


x. f al 
T,;@; = T,,ar€,. 
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Applying a second dot product in the same manner yields 
Tin = T/.QriAsm- (4.49) 


To invert Equation 4.49 use the inverted matrix [a}~' twice and remember, for 
orthonormal coordinate systems, a;;' = aj. This gives 


Tj, = TrsQirQms- (4.50) 


In general, tensor transformations require one aj; factor for each subscript in 
the tensor. In other words, an r rank tensor needs r different a; ; factors. If the 
transformation goes from the unprimed to the primed system, all the a;;’s are summed 
over their second subscript. For the inverse transformation, going from the primed to 
the unprimed system, the sums are over the first subscript. Tensor transformations, 
for arbitrary rank tensors, can be summarized as follows: 


tf =, 
Tix. _ Tyst... dir @ jsQia 26a 
= f 
Tijk.. = Tyse, Gris jk - +. 


where the elements of the matrix [a} are given by Equation 4.35. 

There is another important feature in the tensor notation of Equation 4.19. Unlike 
in a matrix equation where all terms must be in the same basis, the tensor/vector 
notation allows equations to have mixed bases. Imagine the elements of Ohm’s law 
expressed in both a primed and unprimed coordinate system: 


E = E,é; = Eve! (4.51) 
F = 07 ;6 8; = 0/86, 
Ohm’s law reads 
J=@-E, (4.52) 


and any combination of the representations in Equations 4.5] can be used in the 
evaluation. For example: 


Jié; = (0 ,.8)8) (E/€;) = eee ACH ‘7 é;) = OER ay. (4.53) 

The fact that elements of @ from the primed frame are combined with components of 

E from the unprimed frame presents no problem. The dot product of the basis vectors 

takes care of the mixed representations, as long as the order of the basis vectors is 

preserved. This is accomplished in Equation 4.53 by the fact that @; - €. # 5,. This 

type of an operation could not be performed using a matrix representation without 
explicitly converting everything to the same basis. 

The value of expressing a tensor in the form of 4.19 should now be clear. In addition 

to handling the same algebraic manipulations as a matrix array, it also contains all 
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the information necessary to transform the elements from one coordinate system to 
another. Thus a tensor is truly a coordinate-independent, geometric object, just as a 
vector is. 


4.4 TENSOR DIAGONALIZATION 


In physics and engineering problems we often want to diagonalize a tensor. What 
this means is we want to find a particular coordinate system in which the matrix 
array representation of a tensor has nonzero elements only along its diagonal. A rigid 
body will experience no vibration when rotated around any of the three axes of a 
coordinate system in which the moment of inertia tensor is diagonalized. The process 
of balancing a wheel of an automobile makes use of this fact. Small, asymmetrically 
placed weights are added to the rim until one of these special axes lies along the axle. 

Many students get lost in the mathematical process of diagonalization and forget 
that it is actually a transformation of coordinates. In this section, we derive the 
elements of the transformation matrix [a] that diagonalizes a given tensor. We start off 
with a completely theoretical treatment of the subject. Then two numerical examples, 
one nondegenerate and one degenerate, are worked out in detail. 


4.4.1 Diagonalization and the Eigenvalue Problem 


Based on the discussion of the previous section, a tensor @ written in an unprimed 
system must be equivalent to the same tensor written in a primed system: 


——we Ai Bk Atal 
C= Gj jeje; = O5,€,€,. (4.54) 


We are interested in a very special primed system, a system where all the off-diagonal 
elements of @ are zero. In this case, Equation 4.54 becomes 

Cc = o;j86; = ao, ee. (4.55) 
Both the tensor elements and the basis vectors of the unprimed system are presumed 
to be known. The problem is to find o{,, the elements of the tensor in the primed 
system, and é/, the primed basis vectors, such that Equation 4.55 is satisfied. To do 
this, form the dot product of Equation 4.55 with the first primed basis vector é; as 
follows: 


— pp glial. al 
= Tss€,€, * €y 


1,05.) (4.56) 


‘al 
O7,€)- 


all 
& 


M 


} 


Equation 4.56 reveals an important property of the basis vectors of the system where 
the tensor is diagonal. They do not change direction when dot multiplied by the tensor. 
They can, however, change in magnitude. If we define A; = o{,, Equation 4.56 
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becomes 

8 = Ae. (4.57) 
The Aj factor is called an eigenvalue of @. An eigenvalue results when an operation 
on an object produces a constant, the eigenvalue, times the original object. The primed 


basis vector is called an eigenvector. bs 
Now, we introduce the special unit tensor 1, which is defined as 


i =F 8 j€:6; (4.58) 
so that 
1-V=V. (4.59) 


Represented as a matrix, 1 is simply 


: 10 0 
I—(1]}={0 1 oO}. (4.60) 
001 


Using the I tensor, Equation 4.57 can be rearranged to read 
(@ = di) -@ =0. (4.61) 


Expressing @ in the unprimed system, Equation 4.61 can be written in subscript 
notation as 


(Gi; = A; 8; ee; ‘ é = 0. (4.62) 
Equation 4.29 and some rearrangement yields 
e(oi; = Ai 8: )a1; = 0, (4.63) 


where the a; are three of the unknown elements of the transformation matrix relating 
the original coordinate systems to the one in which @ is diagonal. 

The LHS of Equation 4.63 is a vector, and for it to be zero, each of its components 
must be zero. Each component involves a sum over the index j. Equation 4.63 
therefore becomes three equations which can be written in matrix array notation as 


ay ~ Ay O12 O13 ai 0 
21 O22 — Ay 023 ai2| = ]0]. (4.64) 
3) 032 o33 — Ai a13 0 


In order for a set of linear, homogeneous equations such as those in Equation 4.64 
to have a solution, the determinant of the coefficients must be zero: 
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ou At 12 O13 
02} 022 — Ay 023 = 0. (4.65) 
O31 032 033 ~ Al oer 


This results in a third-order equation for A, which will generate three eigenvalues. 
Select one of these values, it does not matter which one because the other two will be 
used later, and call it A,. Inserting this value into Equation 4.64 allows a solution for 
Q1, @;2, and a;3 to within an arbitrary constant. These are three of the elements of 
the transformation matrix between the primed and unprimed systems that we seek. 
These three elements also allow the determination of the é; basis vector to within an 
arbitrary constant: 


@) = ay ;8;. (4.66) 


Requiring €; to be a unit vector determines the arbitrary constant associated with aj, 
a2 and ay3: 


(ai)? + (ay)? + (a3 = 1. (4.67) 


Except for an overall arbitrary sign and the degenerate situation discussed below, we 
have now uniquely determined 6}. 

The other primed basis vectors and elements of the transformation matrix are 
obtained in a similar way. The second primed basis vector is determined by forming 
the dot product in Equation 4.56 using @,. A matrix equation equivalent to 4.64 
is written with Az, a2), a2, and a3. The resulting determinant equation for Az is 
identical to the one for A,, in Equation 4.65. One of the two remaining eigenvalues 
of this equation is selected for Az and used to determine a);, 422, 423, and 6}. Ina 
similar way, the last eigenvalue of Equation 4.65 is used for A3 to determine a3), 432, 
a33, and &. 

The primed coordinate system, in which @ is diagonal, is defined by the basis 
vectors é}, €, and é5. The elements of @ in this primed system are just the eigenvalues 
determined from Equation 4.65, 


A, O 0 
{oJ=}0 A Of. (4.68) 
0 O A; 


The matrices of interest in physics and engineering are typically Hermitian. If 
we allow the possibility for complex matrix elements, a matrix is Hermitian if it is 
equal to its complex conjugate transpose. That is, 7;; = o j;. There are two important 
properties of Hermitian matrices. First, their eigenvalues are always pure real num- 
bers. Second, their eigenvectors are always orthogonal. Proofs of these statements 
are left as exercises at the end of this chapter. 

The only complication that can arise in the above diagonalization process is a 
degenerate situation that occurs when two or more of the eigenvalues are identical. 
Consider the case when A; # Az = A3. The unique eigenvalue A, determines aj), 
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42, a3 and the eigenvector @}, just as before. The degenerate eigenvalues, however, 
will not uniquely specify their eigenvectors. These eigenvectors may be chosen an 
infinite number of ways. An example with this type of degeneracy is discussed in one 
of the examples that follows. 


Example 4.1 As an example of the diagonalization process, consider the conduc- 
tivity tensor expressed in Cartesian coordinates: 


C= G86 ;. (4.69) 


Let this tensor have the matrix representation (ignoring units) 


10 0 0 
[o]=]0 10 1]. (4.70) 
0 1 10 


This matrix is Hermitian, so we can expect to find pure real eigenvalues and orthog- 
onal eigenvectors. The eigenvalues for the diagonalization are generated from the 
determinant equation 


10-A ] = 0. (4.71) 


Expansion of the determinant gives the third-order polynomial equation 
(10 ~ A) [0 - ay’ — 1] = 0, (4.72) 
which has three distinct roots: A; = 9, A2 = 11, and A3 = 10. 


The elements of a, ; are determined by inserting the value of A; into Equation 4.64 
to obtain 


1 0 0 ayy 0 
01 1 an! = };0]. (4.73) 
01 1 aj3 0 

This equation requires aj. = ~a,3 and a1, = 0. The normalization condition imposes 


the additional constraint that (a,2)* + (a;3)* = 1 and results in 


ay] 
1 
ag) =] 1). (4.74) 


413 v2 | -1 


The first eigenvector associated with the primed system becomes 


et = (1/V/2)8 — (1/V/2)8s. (4.75) 
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The other components of [a] can be determined by performing similar calculations 
to give Az and A3. The complete transformation matrix is 


i, 0 Amd 
[a] = — ae (4.76) 


0 
v2 f/2 0 0 
The other two primed eigenvectors are 


@ = (1/V/2)& + (1/V2)e3 (4.77) 


and 


é, = @. (4.78) 

It should be noted that there is an ordering ambiguity associated with the eigenvalues, 

as Well as a sign ambiguity associated with each eigenvectors. These ambiguities allow 

us to always set up the primed system to be right-handed. The ordering and sign 

choices made in this example give the primed basis vectors shown in Figure 4.3. 
The conductivity tensor elements expressed in the new, diagonal system are 


9 0 0 
{[oJ=j0 11 Of. (4.79) 
0 0 10 


Example 4.2. This example demonstrates the diagonalization process when two of 
the eigenvalues are degenerate. Again consider a conductivity tensor with Cartesian 
elements (ignoring units) 


11 -1 0 
[oJ=]-1 11 Of. (4.80) 
0 oOo 10 
1 


Figure 4.3 The Primed System Basis Vectors 
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This is a Hermitian matrix and so we expect pure real eigenvalues and orthogonal 
eigenvectors. The determinant condition is 
l1-A =] 0 
—-t iW-aA 0 = 0, (4.81) 
0 0 10—A 
which leads to the third-order polynomial equation 


(10— A)[(G1-Ay’— 1]. (4.82) 


This third-order equation has three roots, but only two are distinct, A; = 12 and 
Aa = d3 = 10. The A, root can be treated as before. When substituted into Equation 
4.64, the matrix relation becomes 


-1l -i 0 ayy 0 
—~1 -1 Of lan} = fol. (4.83) 
0) 0 =) 413 0 


a2] =—=|-1]. (4.84) 


ay3 V2 0 
These elements of the transformation matrix define the first eigenvector 
ey = (1/298 — C/V2)e. (4.85) 


Now consider the degenerate eigenvalue. When Az = 10 is substituted into Equa- 
tion 4.64 we obtain 


1 -1 0 a2} 0 
~1 1 Of lan] = jo}. (4.86) 
0 0 0 a3 0 


Substitution of A3 gives almost the same equation: 


1 -1 0 a31 0 
== 1 0O ayi' = |0 (4.87) 
0 0 0 a33 0 


Equation 4.86 requires a), = a2, but gives no constraint on a3. The normalization 
condition forces the condition a3, + a3, + a3, = 1. These conditions can be satisfied 
by many different eigenvectors. Since a3 is arbitrary, set it equal to zero. Now if the 
second eigenvector is to be orthogonal to é. we have 


1 
ial le (4.88) 
0 
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This gives for the second eigenvector 


@ = (1/v/2)8 + 1/2). (4.89) 


The eigenvector associated with A3 is given by Equation 4.87 and has all the same 
constraints as the Az eigenvector, namely, a3; = 432 and a3; is arbitrary. If we want 
the eigenvectors to be orthogonal, however, @, must be perpendicular to @| and é}. 
The basis vectors é} and é4 are both in the original 12-plane and so, if €{ is to be 
perpendicular to these two primed basis vectors, it must be along the 3-direction. 
This gives 


a3) 0 
ayn| = 10], (4.90) 
a33 1 
and for the third eigenvector 
&, = 6. (4.91) 


A quick check will show that these three eigenvectors are orthonormal and define a 
right-handed coordinate system in which the elements of the conductivity tensor are 
diagonalized. 


4.55 TENSOR TRANSFORMATIONS IN CURVILINEAR 
COORDINATE SYSTEMS 


The transformations of the previous sections easily generalize to curvilinear coordi- 
nate systems. First consider the intermediate problem of a transformation between a 
Cartesian and a curvilinear system. 

Let the Cartesian system have primed coordinates (x/, x}, x4) and basis vectors 
(é}, @5, 64), while the unprimed, curvilinear system has coordinates (q), 42, q3), basis 
vectors (G1, G2, G3), and scale factors (h,,h2,h3). The set of equations relating the 
coordinates of the two systems can be written as 


X{ = *1(41, 92,93) 
x4 = X4(41» G2, 93) (4.92) 


II 


x3 = X3(41, 92, 93)- 
For example, the standard cylindrical system would use the equations 


x' = pcos@ 


= y’ = psin@ (4.93) 


a 
tl 


Ni 
I 


= 
30% 
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The transformation matrix [a] performs the same function as before. That is, it takes 
the unprimed, curvilinear components of a vector and generates the primed, Cartesian 
components: 


V; = aijV;. (4.94) 


Recall from the previous chapter that the displacement vector for the two systems 
can be written 


d¥ = dx/é! = hjdq ;qj. (4.95) 


The components of the displacement vector in the unprimed curvilinear system are 
given by the hjdqj;, while its components in the primed Cartesian system are given 
by the dx/. These components must be related by the transformation matrix [a]. In 
subscript notation 


dx} = ajjhjdqj. (4.96) 
The total differential dx/ can be formed from Equations 4.92 and becomes 


Oxi(qi, 92, 43) 
= AiG F293) 7 (4.97) 
aq; qj 


dx} 
Equation 4.97 can be placed in the form of Equation 4.96 by multiplying the RHS of 
4.97 by hj/h;: 


_ 9X41 92593) Aj Oxi(91s 92» 93) 


dx! —t = 
04; ie h;aq; 


L 


hjdqj;. (4.98) 


Comparing Equations 4.98 and 4.96 gives 


Ox}(41, 42 fee : 
aij = O%;(G1 G2» 43) (Curvilinear — Cartesian]. (4.99) 
hj0q; 
The further generalization for the transformation between two curvilinear systems 
follows in a straightforward way. The elements for the transformation matrix [a] in 
this case become 


bin i hidg!(41, 92, 43) 
‘ hjoq; 


(Curvilinear — Curvilinear]. (4.100) 


Note there is no sum over i or j on the RHS of Equation 4.100 because both these 
subscripts appear on the LHS of the expression. 

Equation 4.100 is the most general form for the elements of the transformation 
matrix between two curvilinear coordinate systems. It simplifies to Equation 4.99 if 
the primed system is Cartesian since the hk! — 1. It further degenerates to Equation 
4.35 if both the primed and unprimed systems are Cartesian since the h/ and the 
h pone 1. 
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As before, the transformation matrix can also be determined from the basis vectors 
of the two coordinate systems. For the general curvilinear case, the elements of [a] 
are 


aij = (@- Gj). (4.101) 


The above manipulations are fast and easy using subscript notation. It might be a 
useful exercise to go through the same steps using just matrices to convince yourself 
the subscript notation is more efficient. 


4.6 PSEUDO-OBJECTS 


If we consider only transformations that involve rigid rotations or translations, there 
is no way to change a right-handed system into a left-handed system, or vice-versa. 
To change handedness requires a reflection. Transformations that involve reflections 
require the introduction of the so-called “pseudo”-objects. Pseudoscalars, pseudo- 
vectors, and pseudotensors are very similar to their “regular” counterparts, except 
for their behavior when reflected. An easy way to demonstrate the distinction is 
by closely examining the cross product of two regular vectors in right-handed and 
left-handed systems. Another way that emphasizes the reflection properties of the 
transformation is the mirror test, which is presented in Appendix E. 


4.6.1 Pseudovectors 


Consider the right-handed Cartesian coordinate system shown in Figure 4.4. Picture 
two regular vectors in this system, oriented along the first two basis vectors: 


A= Ag (4.102) 
B = Bg. (4.103) 


By “regular,” we mean that the components of these vectors obey Equation 4.25 when 
we transform the coordinates. 
The cross product between A and B can be formed using the determinant, 


fe & 

AXB=([A, 0 O| = A,Bobs, (4.104) 
0 Bo Ol., 
3 


Figure 4.4 The Right-Handed System 
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Figure 4.5 Vectors in the Right-Handed System 


or, equivalently, using the Levi-Civita symbol: 
A X B = A,B je; = AoBoes. (4.105) 


The resulting vector is shown in Figure 4.5. Notice how the direction of A XB is given 
by the standard right-hand rule. If you point the fingers of your hand in the direction 
of A and then curl them to point along B, your thumb will point in the direction of the 
cross product. Keep in mind that the cross product is not commutative. If the order 
of the operation is reversed, that is if you form B X A, the result points in exactly the 
opposite direction. 

Now consider the left-handed system shown in Figure 4.6, with coordinates and 
basis vectors marked with primes to distinguish them from the coordinates and basis 
vectors of the right-handed system. This system results from a simple inversion of 
the 1-axis of the unprimed system. It can also be looked at as a reflection of the right- 
handed system about its x.x3-plane. The equations relating the primed and unprimed 
coordinates are 


x, = 7X) 
x5 = +x (4.106) 
x4 = +3 

3 


Figure 4.6 The Left-Handed System 


88 INTRODUCTION TO TENSORS 


so that the transformation matrix becomes 


-1 0 0 
faqj=}] 0 1 OF. (4.107) 
0 0 1 


The regular vectors A and B in the primed coordinate system are simply 
A = —A, & (4.108) 
B = B, é}. (4.109) 


We just wrote these results down because they were obvious. Remember, formally, 
they can be obtained by applying [a] to the components of the unprimed vectors. The 
matrix multiplication gives 


Aj -1 0 0 Ag —Ao 
Aj} =]|0 1 0 O|= 0 (4.110) 
Aj 0 01 0 0 

and 
Bi -1 0 0 0 0 
By} ={0 1 0 Bo| = | Bol. (4.111) 
Bi 0 O01 0 0 


It is important to remember that the vectors are the same physical objects in both 
coordinate systems. They are just being expressed in terms of different components 
and different basis vectors. 

Now form the cross product of A and B in the left-handed system. To do this we 
will use the same determinant relation: 


— _ | & & & 
AXB=!/-A, 0 O| =~—A,B, &, (4.112) 
0 Bo 0 det 
or in terms of the Levi-Civita symbol: 
A X B= Aj BY & ex = Aj By €123 = — Ay Bo 8}. (4.113) 


The vectors A and B and the cross product A X B are shown in Figure 4.7 for the 
left-handed coordinate system. Notice now, the right-hand rule used earlier no longer 
works to find the direction of the cross product. If we define the cross product using 
the determinant in Equation 4.112, then we must use a left-hand rule if we are ina 
left-handed coordinate system. 

There is something peculiar here. Compare Figures 4.7 and 4.5 notice that while 
A and B point in the same directions in the two systems, their cross product does not! 
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Figure 4.7 Vectors in The Left Handed System 


By changing the handedness of the coordinate system, we have managed to change 
the vector A x B. 

Let’s look at this cross product from another point of view. If the unprimed 
components of the quantity A < B, given in Equation 4.104, are transformed into the 
primed system using the [a] matrix, as one would do for regular vector components, 
we obtain 


-1 0 0 0 0 
0 10 o |=| 0 |. (4.114) 
0 0 1 AoBo AoBo 


Combining these components with the appropriate basis vectors gives for the cross 
product vector 


Ao Bo &}. (4.115) 


This result disagrees with Equation 4.112 by a minus sign. To get around this difficulty, 
a quantity formed by the cross product of two regular vectors is called a pseudovector. 
Pseudovectors are also commonly called axial vectors, while regular vectors are 
called polar vectors. If V is a regular vector it transforms according to Equation 4.25. 
However, if V is a pseudovector, its components transform according to 


Vi = alge Viari- (4.116) 


In this way Equation 4.114 becomes 


(A X B)! -1 0 0 0 0 
(AXBy|=-|0 1 0 0 7 0 (4.117) 
(A X B)h 0 O 1] |A,Bo —A,B, 
giving 
A X B= —A, B, 6, (4.118) 


in agreement with Equations 4.112 and 4.113. 
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To summarize, if V is a regular vector its components transform as 
Vi = V; aj. (4.119) 
If instead it is a pseudovector, it components transform as 
Ve = lalaer Viari- (4.120) 


If the handedness of two orthonormal coordinate systems is the same, a transformation 
between them will have |a|,., = 1 and vectors and pseudovectors will both transform 
normally. If the systems have opposite handedness, |a|4., = ~1 and vectors will 
transform normally but pseudovectors will flip direction. A vector generated by a 
cross product of two regular vectors is actually a pseudovector. 

It is tempting to think that all this balderdash is somehow a subtle sign error 
embedded in the definition of the cross product. In some cases, this is correct. For 
example, when we define the direction of the magnetic field vector, which turns out 
to be a pseudovector, we have implicitly made an arbitrary choice of handedness 
that must be treated consistently. Another example is the angular momentum vector, 
which is defined using a cross product. While you could argue that the “pseudoness” 
of these two examples is just a problem with their definition, there are cases where 
you cannot simply explain this property away. It is possible to design situations where 
an experiment and its mirror image do not produce results which are simply the mirror 
images of each other. In fact, the Nobel Prize was won by Lee and Yang for analyzing 
these counterintuitive violations of parity conservation. The classic experiment was 
first performed by Wu, who showed this effect with the emission of beta particles 
from Cobalt-60, under the influence of the weak interaction. 


4.6.2 Pseudoscalars 


The ideas that led us to the concept of pseudovectors apply to scalars as well. A proper 
scalar is invariant to any change of the coordinate system. In contrast, a pseudoscalar 
changes sign if the handedness of the coordinate system changes. A pseudoscalar 
involved in a transformation, governed by the transformation matrix [a], will obey 


S’ = alge S. (4.121) 


A good example of a pseudoscalar derives from the behavior of the cross product 
operation. The volume of a three-dimensional parallelogram, shown in Figure 4.8, 
can be written as 


Volume = (A X B)- C. (4.122) 


In a right-handed system, the vector formed by A X B will point in the upward 
direction. So in a right-handed system, 


(A X B)-C>0. (4.123) 
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Figure 4.8 The Parallelogram 


In a left-handed system, A X B points downward, and consequently 
(A X B)-€ <0. (4.124) 


Interpreted in this way, the volume of a parallelogram is a pseudoscalar. 


4.6.3 Pseudotensors 


Pseudotensors are defined just as you would expect. Upon transformation, the com- 
ponents of a pseudotensor obey 


Tha. = NAldet Tijk... ri sj Atk ++» (4.125) 


which is exactly the same as a regular tensor, except for the |a|,., term. 

Again we turn to the cross product to find a good example. Consider two coordinate 
systems. One, the unprimed system, is a right-handed system, and the other, with 
primed coordinates, is left-handed. Using the Levi-Civita symbol in both coordinate 
systems to generate the cross product of A and B gives the relation 

A(B jey €ijx = —A, Biel... (4.126) 
The minus sign occurs because, as we showed earlier, the physical direction of the 
cross product is different in the two coordinate systems. Now the transformation 
properties of regular vectors can be used to find the relationship between e; , and €;,,. 
Because A, B, and the basis vectors are all regular vectors, they transform according 
to Equation 4.25. Writing the primed components of these vectors in terms of the 
unprimed, Equation 4.126 becomes 
AjB jek €ijk a ~ A;B jbx aids jax, (4. 1 27) 


rst* 
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This expression is true for arbitrary A and B, so we obtain the result 
Eijk =~ Apis JAKE; p- (4.128) 


Keep in mind this applies only when the two systems have opposite handedness. 
If both systems have the same handedness, the minus sign disappears. Thus for 
the general case of arbitrary transformation between two orthonormal systems, the 
Levi-Civita symbol components obey 


€ije = lalderQriGs Ak Els, (4.129) 


Consequently, the Levi-Civita symbol is a pseudotensor. 


EXERCISES FOR CHAPTER 4 


1. Determine the transformation matrix [a] that corresponds to a rotation of a two- 
dimensional Cartesian system by 30°. Obtain the inverse of this matrix [a]~! and 
evaluate the following operations: 

=I 
(a) Qj; aim . 
(b) a;;'Gm,j. 
(c) j;'Om ye 

2. Consider the transformation from a standard two-dimensional Cartesian system 
(x, x2) to a primed system (x;, x5) that results from a reflection about the x-axis, 
as shown below: 


(a) Express the coordinates of a point in the primed system in terms of the 
coordinates of the same point in the unprimed system. 
(b) Determine the elements of the transformation matrix [a] that takes vector 
components from the unprimed system to components in the primed system. 
(c) Determine [a]! in three ways: 
i. By inverting the [a] matrix found in part (b). 
ii. By inverting the coordinate equations of part (a). 
iii. By simply switching the primed and unprimed labels on the coordinates. 
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3. 


wa 


Determine the elements of the transformation matrix [a] that generates the Carte- 
sian components of a vector from its components in the toroidal system defined 
in Exercise 10 of Chapter 3. 


. The coordinates of a hyperbolic system (u, v, z) are related to a set of Cartesian 


coordinates (x, y, z) by the equations 


u=x'-y? 


v = 2xy 
dia 


Determine the elements of the transformation matrix [a] that takes the Cartesian 
components of a vector to the hyperbolic components. Using this transformation 
matrix and the position vector expressed in the Cartesian system, express the 
position vector in the hyperbolic system. 


Consider two curvilinear systems, an unprimed cylindrical system and a primed 

spherical system. 

(a) What are the equations relating the cylindrical coordinates to the spherical 
coordinates? 

(b) Find the elements of the transformation matrix [a] that generates the primed 
vector components from unprimed components. 


Consider a two-dimensional, primed Cartesian system that is shifted an amount x, 
and rotated an amount 6, with respect to a two-dimensional unprimed Cartesian 
system, as shown in the figure below. 


, 


es 


(a) Identify the equations that generate the xy-coordinates from the x’y’- 
coordinates. 

(b) Determine the elements of the transformation matrix [a] that generate the 
primed components of a vector from the unprimed components. 


. Consider a two-dimensional Cartesian xy-system and a shifted polar p’6'-system, 


as shown in the figure below. The origin of the shifted polar system is located at 
X=1Pro,y = 0. 


94 
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(a) In a sketch, pick a point P well off the x-axis and draw the Cartesian and 
shifted polar basis vectors. 

(b) Express the Cartesian coordinates in terms of the shifted polar coordinates. 
Invert these equations and express the shifted polar coordinates in terms of 
the Cartesian coordinates. 

(c) Find the elements of the transformation matrix [a] that relates the shifted 
polar basis vectors to the Cartesian basis vectors. 

(d) Express the displacement vector dF first in the Cartesian system and then in 
the shifted polar system. 


A two-dimensional (u, v) elliptical coordinate system can be related to an (x, y) 
Cartesian system by the coordinate equations 


x = coshucos v 


y = sinhusin v. 


Determine the elements of the transformation matrix [a] that converts vector 
components in the Cartesian system to vector components in this elliptical sys- 
tem. What is [a]~!? 


Consider three coordinate systems: (1) a Cartesian system with coordinates 
(x1, x2, x3) and basis vectors (€), €), 63); (2) a curvilinear system with coordinates 
(41, 92,43) and basis vectors (@), 42, G3); and (3) a second curvilinear system 
with coordinates (q}, q3,q4) and basis vectors (@}, @5, @4). Let the relationships 
between the Cartesian coordinates and these curvilinear coordinates be given by 


X= X(41.92593) X= X'(4}, 44,45) 
y = y(q1.92.93) = y = y'(4i. 95,95) (4.130) 
z= 291,92,93) 2 = 2'(4)5 93,95). 
(a) Find the general expressions for the elements of the transformation matrix 
[a] that takes vector components from one curvilinear system to the other. 
(b) Take one of the curvilinear systems to be the cylindrical system, and the 


other to be the elliptical system of Exercise 8 and specifically determine the 
elements of [a] and its inverse. 
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10. 


11. 


Consider a two-dimensional Cartesian system with coordinates (x > x), y > x2) 

and basis vectors (€, — 6€),é, — @)) and a polar system with coordinates 

(p — qi, @ — qz) and basis vectors (€, — 4i,éy — @). 

(a) Express the Cartesian coordinates (x), x2) in terms of the polar coordinates 
(91, 92). 


(b) Express the position vector F, first using the Cartesian basis vectors and then 
using the polar basis vectors. 

(c) Determine the elements of the transformation matrix [a] that takes vector 
components from the polar system to vector components in the Cartesian 
system. Show that this transformation matrix works for the expressions for 
the position vector of part (b) above. 


(d) What are the elements of [a]~!, the matrix that generates polar components 
from the Cartesian components? 


(e) Determine the elements of the displacement vector dF in the Cartesian and 
polar systems. 


(f) A second-rank tensor expressed in the Cartesian system takes the following 
form 


What are its elements in the polar system? 


Consider the transformation from a two-dimensional Cartesian 12-system to a 
Cartesian 1'2’-system, which includes both an inversion and a rotation as shown 
in the drawing below. 


(a) Express the vector V = 3é, + 2é) in the 1/2’-system. 


(b) Express T=i ‘6,6; in the 1'2'-system. Do not forget to sum over the repeated 
i and j subscripts. 
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12. Consider a two-dimensional conductivity tensor @ whose elements expressed in 
an (x, y) Cartesian system have the matrix representation 


e—|} ° 
O 2|° 


The current density J is related to the electric field E through Ohm’s law: 


J=G-E. 


(a) If the electric field is given by E = E,é, + E,€y, express the current density 
in this Cartesian system. 

(b) Now transform the components of & to a new orthogonal (x', y’) system, 
where the basis vector é/. is parallel to the E given in part (a) above. The 
electric field in this new system is given as E = V2Ee. What are the 
elements of the transformation matrix that accomplishes this? What are the 
components of @ in this new system? 

(c) With @ and E expressed in the primed system, form the product @ - E in that 
system and show that it results in the same vector obtained in part (a) above. 


13. Show, using matrix array and subscript/summation notation, that in general 
G-E#E-G. 


14, Consider the inner product of two second-rank tensors, each expressed in the 


same orthonormal system: 


e 


A-B. 
In subscript/summation notation this product involves the term 
&é; : Em€n. 


(a) Develop an expression for A-B using the Kronecker-6 and show that the 
result is also a second-rank tensor. 


(b) Show that A - B does not equal B - A. 


15. Let T bea second-rank tensor and V be a vector. Expressed in a Cartesian system, 


(a) Express the following using subscript/summation notation and, where nec- 
essary, the Levi-Civita symbol 
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16. 


17. 


18. 


i. V-T. 
ii TV. 
iii. (V-T)xV 


iv. (V-T)X (T-V). 
v. (V-T)-(T-V). 
(b) What values for the elements of T, other than 7;; = 0, will result in Vx 
(V - T) = 0 for any V? 
(c) What values for the elements of T, other than 7; j = 0, will result in V-(V- 
T) = O for any V? 


Use subscript/summation notation to show that the expressions 
(F-C)xD and F. (exp) 


are not necessarily equal. 


An orthogonal (u, v,z) coordinate system is defined by the set of equations 
relating its coordinates to a standard set of Cartesian coordinates, 


_ x 
tO ty 
pease 

x2 + y? 
z= %, 


(a) Determine the elements of the transformation matrix [a]. 

(b) At the Cartesian point (1, 1, 1) let the vector V = 1é, +2 é, + 3 é,. Express 
V at this point using (u, v, z) system vector components and basis vectors. 

(c) At the Cartesian point (1, 1, 1) let the tensor T = 16,6, + 2@,é@, + 3 @,é, + 
4@,é,. Express T at this point using (u, v,z) system tensor elements and 
basis vectors. 


Consider a dumbbell positioned in the xy-plane of a Cartesian system, as shown 
in the figure below. The moment of inertia tensor for this object expressed in this 
Cartesian system is 


I = 1,8; 
7 2 -2 0 
I—-M,|-2 2 0 
0 0 4 


Find the basis vectors of a coordinate system in which the moment of inertia 
tensor is diagonalized. Draw the dumbbell in that system. 
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19. Let a tensor T = T;;€;€; in a two-dimensional Cartesian system be represented 
by the matrix array 
= 3 1 
T- 3 | . 


(a) Find the eigenvalues and eigenvectors of this matrix array. 

(b) Plot the eigenvectors and show their relation to the Cartesian basis vectors. 
(c) Are the eigenvectors orthogonal? Is the matrix array Hermitian? 

(d) Repeat parts (a)—(c) for the tensor 


= 1 1 
To E 1 : 
20. Let the components of the conductivity tensor in a Cartesian system be repre- 
sented by 
3 -1 0 
@—-[oJ={}-1 3 0O 
0 Oo 1 


Identify an electric field vector, by specifying its components, that will result in 
a current density that is parallel to this electric field. 


21. Using subscript/summation notation, show that the eigenvalues of a Hermitian 
matrix are pure real. Show that the eigenvectors generated by the diagonalization 
of a Hermitian matrix generate orthogonal eigenvectors. 

Start with a second-rank tensor that is assumed to have off-diagonal elements 
that may be complex, 
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22. 


23. 


Let the eigenvalues of the matrix representation of this tensor be A; and the 
eigenvectors associated with these eigenvalues be &/ = a; j&; So that 

T; {En J = Anni 
Notice that although the n subscript is repeated on the RHS it is not summed over 
because it appears on the LHS, where it is clearly not summed over. Now form 
the quantities a7,,7';;,a,,; and a,,T ;;a,,; and subtract one from the other. Apply the 
Hermitian condition and see what it implies. 


Is the dot product between two pseudovectors a scalar or a pseudoscalar? Give 
an example from classica] mechanics of such an operation. 


Show that the elements of the Kronecker delta 5,; transform like elements of a 
tensor, not a pseudotensor. 


5 


THE DIRAC 6-FUNCTION 


The Dirac 6-function is a strange, but useful function which has many applications in 
science, engineering, and mathematics. The 6-function was proposed in 1930 by Paul 
Dirac in the development of the mathematical formalism of quantum mechanics. He 
required a function which was zero everywhere, except at a single point, where it was 
discontinuous and behaved like an infinitely high, infinitely narrow spike of unit area. 
Mathematicians were quick to point out that, strictly speaking, there is no function 
which has these properties. But Dirac supposed there was, and proceeded to use it 
so successfully that a new branch of mathematics was developed in order to justify 
its use. This area of mathematics is called the theory of generalized functions and 
develops, in complete detail, the foundation for the Dirac 6-function. This rigorous 
treatment is necessary to justify the use of these discontinuous functions, but for the 
physicist the simpler physical interpretations are just as important. We will take both 
approaches in this chapter. 


5.1 EXAMPLES OF SINGULAR FUNCTIONS IN PHYSICS 


Physical situations are usually described using equations and operations on contin- 
uous functions. Sometimes, however, it is useful to consider discontinuous ideal- 
izations, such as the mass density of a point mass, or the force of an infinitely fast 
mechanical impulse. The functions that describe these ideas are obviously extremely 
discontinuous, because they and all their derivatives must diverge. For this reason 
they are often called singular functions. The Dirac 6-function was developed to de- 
scribe functions that involve these types of discontinuities and provide a method for 
handling them in equations which normally involve only continuous functions. 
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‘Force | Momentum 
! 


Area =-A(m,V) 


time time 


ty t t “th 
(a) (b) 


Figure 5.1 Force and Change in Momentum 


5.1.1 The Ideal Impulse 


Often a student’s first encounter with the 5-function is the “ideal” impulse. In me- 
chanics, an impulse is a force which acts on an object over a finite period of time. 
Consider the realistic force depicted in Figure 5.1(a). It is zero until tf = t,, when it 
increases smoothly from zero to its peak value, and then finally returns back to zero 
at f = t). When this force is applied to an object of mass m,, the momentum in the 
direction of the applied force changes, as shown in Figure 5.1(b). The momentum 
remains constant until t = t;, when it begins to change continuously until reaching 
its final value at ¢ = t). The net momentum change A(m, v) is equal to the integrated 
area of the force curve: 


+o h 
dt F(t) = / dt F(t) 
als h 


h du 
= dt m>— = mov 
i ° at oar 


2 


= A(m,v). (5.1) 


— Mov 


An ideal impulse produces all of its momentum change instantaneously, at the 
single point t = t,, as shown in Figure 5.2(a). Of course this is not very realistic, 
since it requires an infinite force to change the momentum of a finite mass in zero 
time. But it is an acceptable thought experiment, because we might be considering 
the limit in which a physical process occurs faster than any measurement can detect. 


| Momentum Force 


& 
3 
° 

Ss 


Le Area = A(m,Vv) 


l time io ae 


(a) (b) 


Figure 5.2 An Instantaneous Change in Momentum 
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The force of an ideal impulse cannot be graphed as a function of time in the normal 
sense. The force exists for only a single instant, and so is zero everywhere, except at 
t = f,, when it is infinite. But this is not just any infinity. Since the total momentum 
change must be A(z, v), the force must diverge so that its integral area obeys (for all 
t.<t4) 


i‘ _ J AGmov) <i, <ty 
/ oO ‘c otherwise ~ (5.2) 


In other words, any integral which includes the point t, gives a momentum change 
of A(m,v). On the other hand, integrals which exclude ¢, must give no momentum 
change. We graph this symbolically as shown in Figure 5.2(b). A spike of zero width 
with an arrow indicates the function goes to infinity, while the area of the impulse is 
usually indicated by a comment on the graph, as shown in the figure, or sometimes 
by the height of the arrow. 

The Dirac 5-function 5(t) was designed to represent exactly this kind of “patho- 
logical” function. 5(t) is zero everywhere, except at t = 0, when it is infinite. Again, 
this is not just any infinity. It diverges such that any integral area which includes 
t = Ohas the value of 1. That is (for all ¢_ < ¢,), 


[#s={6 fe at, (5.3) 
t. 


otherwise 


The symbolic plot is shown in Figure 5.3. The ideal impulse, discussed above, can 
be expressed in terms of a shifted Dirac 5-function: 


F(t) = A(m,v) &(t — to). (5.4) 


The t — t, argument simply translates the spike of the 5-function so that it occurs at 
t, instead of 0. 


5.1.2 Point Masses and Point Charges 


Physical equations often involve the mass per unit volume p,,(r) of a region of space. 
Normally p,,(F) is a continuous function of position, but with 6-functions, it can also 
represent point masses. A point mass has a finite amount of mass stuffed inside a 
single point of space, so the density must be infinite at that point and zero everywhere 
else. 


d(t) 


: Area = 1 


t 


Figure 5.3 The Dirac 6-function, 5(t) 
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Figure 5.4 A Single Point Mass at the Origin 


Integrating the mass density over a volume V gives the total mass enclosed: 
/ dT Pm(T) = total mass inside V. (5.5) 
Vv 


Thus, if there is a single point of mass m, located at the origin, as shown in Figure 5.4, 
any volume integral which includes the origin must give the total mass as m,. Integrals 
which exclude the origin must give zero. In mathematical terms: 


=) _ J Mo origin included in V 
[a Pus(F) { 0 origin excluded from V ” oo) 


Using Dirac 5-functions, this mass density function becomes 
Pm(F) = mo8(x)8(y)8(2). (5.7) 
Equation 5.6 can easily be checked by expanding the integral as 
‘i: aT p(t) = [a fo fe mM, 5(x)5(y)8(z). (5.8) 
Application of Equation 5.3 on the three independent integrals gives m, only when 


V includes the origin. If, instead of the origin, the point mass is located at the point 
(Xo. Yo Zo), Shifted arguments are used in each of the 5-functions: 


Pm) = mo d(x Xo)B(Y — Yo)S(Z — 2). (5.9) 


Dirac 5-functions can also be used, in a similar way, to represent point charges in 
electromagnetism. 


5.2 TWO DEFINITIONS OF 6() 


There are two common ways to define the Dirac 5-function. The more rigorous 
approach, from the theory of generalized functions, defines it by its behavior inside 
integral operations. In fact, the 5-function is actually never supposed to exist outside 
an integral. In general, scientists and engineers are a bit more lax and use a second 
definition. They often define the 6-function as the limit of an infinite sequence of 
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-1/2n 1/2n 


Figure 5.5 The Square Sequence Function 


continuous functions. Also, as demonstrated in the two examples of the previous 
section, they frequently manipulate the 5-function outside of integrals. Usually, there 
are no problems with this less rigorous approach. However, there are some cases, 
such as the oscillatory sequence functions described at the end of this section, where 
the more careful integral approach becomes essential. 


5.2.1 &(0 as the Limit of a Sequence of Functions 


The 5-function can be viewed as the limit of a sequence of functions. In other words, 


5(t) = lims,.(), (5.10) 


where 6,,(f) is finite for all values of t. 
There are many function sequences that approach the Dirac 5-function in this way. 
The simplest is the square function sequence defined by 


_ fn —1f/2n<t< +1/2n 
Bult) = { 0 _ otherwise 6) 
and shown in Figure 5.5. Clearly, for any value of n 
/ dt 5,(t) = 1, (5.12) 


and in the limit as n — ©, 6,(t) = 0 for all t, except t = 0. The first three square 
sequence functions for n = 1,2, and 3 are shown in Figure 5.6. 


83(t)_ 
52(t) 


8,(0) — 


ee 


-1/2 -1/4 -1/6 1/6 1/4 1/2 


Figure 5.6 The First Three Square Sequence Functions 
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5,(t) 


Figure 5.7 The Resonance Sequence Function 


There are four other common function sequences that approach the Dirac 6- 
function. Three of them, the resonance, Gaussian, and sinc squared sequences, are 
shown in Figures 5.7—5.9 and are mathematically described by 


Resonance: 6,{f) = me 


Gaussian: ,(t) = “We (5.13) 


n 
—~e 
Ve 
sin? nt 
ate 


Sinc Squared: 6,(t) = 


Each of these functions has unit area for any value of n, and it is easy to calculate 
that in the limit as n — ©, 6,(f) = 0 for all t # 0. 

There is one other common sequence, the sinc function sequence, which ap- 
proaches the 5-function in a completely different manner. The discussion of this 
requires more rigor, and is deferred until the end of this section. 


| 8,(t) 


n/(m) 2 


I/n 


Figure 5.8 The Gaussian Sequence Function 
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t/n 


Figure 5.9 The Sinc Squared Sequence Function 


5.2.2 Defining 5(¢) by Integral Operations 


In mathematics, the 6-function is defined by how it behaves inside an integral. Any 
function which behaves as 8(t) in the following equation is by definition a 5-function, 


[ dt 8(t — t)) f(t) = a pres ee (5.14) 


otherwise 


where t_ < t, and f(#) is any continuous, well-behaved function. This operation 
is sometimes called a sifting integral because it selects the single value f(t.) out of 
f(t). 

Because this is a definition, it need not be proven, but its consistency with our 
previous definition and applications of the 6-function must be shown. Ift_ < ty < t+, 
the range of the integral can be changed to be an infinitesimal region of size 2€ centered 
around f,, without changing the value of the integral. This is true because S(t — t,) 
vanishes everywhere except at ¢ = t,. This means 


ty fo +e 
/ at 5(t — t.)f(@) = / dt S(t — t,) f(t). (5.15) 
fo to € 
Because f(t) is a continuous function, over the infinitesimal region it is effectively a 
constant, with the value f(t,). Therefore, 


' ” dt 8(t — to) f(t) = flte) dt 8(t—t) = fle). (5.26) 


This “proves” the first part of Equation 5.14. The second part, when ¢, is not inside 
the range t. < t < t+, easily follows because, in this case, 6(t — t,) is zero for the 
entire range of the integrand. 

Figure 5.10 shows a representation of the integration in Equation 5.14. The inte- 
grand is a product of a shifted 5-function and the continuous function f(t). Because 
the 5-function is zero everywhere except at t = 1,, the integrand goes to a 6-function 
located at t = f,, with an area scaled by the value of f(t,). 
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f(t) 


SUC MARLEE pea ict) 


Figure 5.10 A Graphical Interpretation of the Sifting Integral 


5.2.3 The Sinc Sequence Function 


The sinc function can be used to form another set of sequence functions which, in the 
limit as 2 — ©, approach the behavior of a 5-function. The sinc function sequence 
is shown in Figure 5.11 and is defined by 


sin nt 


Since: 6,(t) = (5.17) 
Notice, however, that as n — © this 5,(t) does not approach zero for all t # 0, so the 
sinc sequence does not have the characteristic infinitely narrow, infinitely tall peak 
that we have come to expect. How, then, can we claim this sequence approaches a 
6-function? 

The answer is that the sinc function approaches the 6-function behavior from a 
completely different route, which can only be understood in the context of the integral 
definition of the 6-function. In the limit as n — ©, the sinc function oscillates infinitely 
fast except at t = 0. Thus, when the sequence is applied to a continuous function 
f(t) in a sifting integral, the only contribution which is not canceled out by rapid 
oscillations comes from the point ¢ = 0. The result is, as you will prove in one of the 


| 8, (t) 


Figure 5.11 The Sinc Sequence Function 
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exercises at the end of this chapter, the same as you would expect for any 6-function: 


ss _ sinnt _ f f) t<0<t 
- dt [lim mt fO= t otherwise — (5.18) 
This means, in the limit n -> ©, the sinc sequence approaches the 6-function: 
_ sinnt _ 
jim aes 5(¢). (5.19) 


5.3. 6-FUNCTIONS WITH COMPLICATED ARGUMENTS 


So far, we have only considered the Dirac 5-function with either a single, independent 
variable as an argument, e.g., 5(z), or the shifted version 5(¢ — t,). In general, however, 
the argument could be any function of the independent variable (or variables). It turns 
out that such a function can always be rewritten as a sum of simpler 5-functions. 
Three specific examples of how this is accomplished are presented in this section, 
and then the general case is explored. In each case, the process is the same. The 
complicated 6-function is inserted into an integral, manipulated, and then rewritten 
in terms of 5-functions with simpler arguments. 


5.3.1 6(-d 


To determine the properties of 5(—1t), make it operate on any continuous function 
F(@ inside a sifting integral, 


tH 
/ dt 8(—t)f (0), (5.20) 
1 
where t. < t,. The substitution ¢’ = —t transforms this to 
ty -t- 
s i dt’ a(t')f(-t!) = | dt! a(t") f(—2') 
te ty 
x ee —t, <O0< -t. (5.21) 


0 otherwise : 


where the last step follows from the integral definition of 5(t) in Equation 5.14. 
Therefore, we have 


ty 
| dt &(—1) f(t) = gaa ae (5.22) 


otherwise 


But notice this is precisely the result obtained if a 5(#) were applied to the same 
function: 
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| * dt 8) f(t) = { ee (5.23) 


otherwise 


Remember, by definition, anything that behaves like a 5-function inside an integral 
is a §-function, so 6(—t) = 8(t). This implies the 5-function is an even function. 


5.3.2 8(at) 


Now consider another simple variation 5(ar), where a is any positive constant. Again, 
start with a sifting integral in the form 


ia dt f(t)d(at). (5.24) 


With a variable change of t' = at, this becomes 


t,/a i 
| dt! fi’ Jayg(t) _ { f(0)/a t_/a < 0<t,/a (5.25) 
eda a 0 otherwise 
so that 
a _ff@/a t<0<t 
[ dt f(t)8(at) = { 0 pent 72 (5.26) 


Notice again, this is just the definition of the function 6(t) multiplied by the constant 
1/a, so 


d(ax) = 8(x)/a a>0. (5.27) 


This derivation was made assuming a positive a. The same manipulations can be 
performed with a negative a, and combined with the previous result, to obtain the 
more general expression 


&(ax) = &(x)/lal. (5.28) 
Notice that our first example, 5(t) = 6(—1), can be derived from Equation 5.28 by 


setting a = —1. 


5.3.3 8? ~ a?) 


As a bit more complicated argument for the Dirac 5-function, consider 8(t? — a). 
The argument of this function goes to zero when = +a and t = —a, which seems 
to imply two 5-functions. To test this theory, place the function in the sifting integral 


i, : dt f(t)(t? — a’). (5.29) 
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There can be contributions to this integral only at the zeros of the argument of 
the 5-function. Assuming that the integral range includes both these zeros, i.e., 
t_ < —aand t, > +a, this integral becomes 


ate tate 
| dt f(t)8(t? ~ a’) + i dt f(t)5(t? — a”), (5.30) 
Tae +a-€ 
which is valid for any value of O << € <a. 

Now ?t? ~— a? = (t — a)(t + a), which near the two zeros can be approximated by 


(t + a)(—2a) t+ —a 


(= ada 24a R20 


P- a =¢-ae+a~{ 


Formally, these results are obtained by performing a Taylor series expansion of t* — a” 
about both t = —a and t = +a and keeping terms up to the first order. The Taylor 
series expansion of 7 ~ a? around an arbitrary point ¢, is, to first order, given by 


vn 
P-~a@~t—a+ aes 2) (t — ty). (5.32) 
dt 
t=t, 
In the limit as € — 0, Integral 5.30 then becomes 
ate tate 
i dt f(t)8(—2a(t + a)) + | dt f(t)8(2a(t — a)). (5.33) 
=a +t+a-€ 


Using the result from the previous section, 5(at) = 6(t)/lal, gives 
be 2 2 1 1 
dt f(r)6(t* — a“) = — f(—a) + —f (+a). (5.34) 
ye 2a 2a 
Therefore, 5(t? — a) is equivalent to the sum of two 6-functions: 
(t* — a“) = —6(t — a) + —6(t + a), (5.35) 
2a 2a 


as shown in Figure 5.12. 
: 5(x?2 — a’) 


area = 1/2a | area = 1/2a 


— = 


ig a 
Figure 5.12 The Plot of 8(x” — a’) 
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5.3.4 The General Case of 5(f(#)) 


Using the Taylor series approach, the previous example can be easily generalized to 
an arbitrary argument inside a 6-function: 


b(t — t;), (5.36) 


1 
a(f0) = > Lavaticr 


where the sum over i is a sum over all the zeros of f(t) and tf; is the value of t where 
each zero occurs. 


5.4 INTEGRALS AND DERIVATIVES OF 85(¢) 


Integrating the 5-function is straightforward because the 6-function is defined by its 
behavior inside integrals. But, because the 6-function is extremely discontinuous, 
you might think that it is impossible to talk about its derivatives. While it is true that 
the derivatives cannot be treated like those of continuous functions, it is possible to 
talk about them inside integral operations or as limits of the derivatives of a sequence 
of functions. These manipulations are sometimes referred to as 5-function calculus. 


5.4.1 The Heaviside Unit Step Function 


Consider the function H(x), which results from integrating the 6-function: 


A(x) = if at d(t). (5.37) 


-a 


H(x) can be interpreted as the area under 6(t) in the range from tf = -x tof = x, 
as shown in Figure 5.13. If x < 0, the range of integration does not include t = 0, 
and H(x) = 0. As soon as x exceeds zero, the integration range includes t = 0, and 
A(x) = 1: 


H(x) = | dt 8(t) = os pee (5.38) 


x 


Figure 5.13 The Integration for the Heaviside Step Function 
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Figure 5.14 The Heaviside Function 


Strictly, the value of H(0) is not well defined, but because 8(t) is an even function, 
many people define H(0) = 1/2. A plot of the Heaviside function is shown in 
Figure 5.14. 

Ironically, the function does not get its name because it is heavy on one side but, 
instead, from the British mathematician and physicist, Oliver Heaviside. 


5.4.2 The Derivative of 5(4) 


The derivatives of 5(t) are, themselves, very interesting functions with sifting prop- 
erties of their own. To explore these functions, picture the first derivative of 5(f) as 
the limit of the derivatives of the Gaussian sequence functions: 


dod(t) ae d6,(t) _ li d n al ; (5.39) 


1m 
at no dt ne dt 


Figure 5.15 depicts the limiting process. 

The first derivative of 5(t), often written 5/(t), is called a “doublet” because of 
its opposing pair of spikes, which are infinitely high, infinitely narrow, and infinitely 
close together. Like 5(t), the doublet is rigorously defined by how it operates on other 
functions inside an integral. The sifting integral of the doublet is 


i. dt f(t)d(t), (5.40) 


ft 


d6,(t)/dt 


Figure 5.15 The Derivative of 5(t) 
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where f(t) is any continuous, well-behaved function. This integral can be evaluated 
using integration by parts. Recall that integration by parts is performed by identifying 
two functions, u(t) and v(t), and using them in the relation 


“b t=b b 
i d(u(v(e) = ulryo| = i) d(v(t))u(t). (5.41) 


To evaluate Equation 5.40, let v(t) = f(t) and du = 6'(t)dt. Then dv = (df/dt) dt 
and u(t) = 6(f) so that Equation 5.40 becomes 


/ dt f(08"(t) = 00) ie = i dt LO 500, (5.42) 


Ift. # Oandr, # 0, Equation 5.42 reduces to 
te T+ d 
/ dt f(t)6'(t) = aa dt dt FO xn, (5.43) 
te y 


because 4(t) is zero for t # 0. This is now in the form of the standard sifting integral, 
so 


1. . { ~f'0) t<0<KH 
‘ BOSS 10 otherwise Oe) 
The doublet is the sifting function for the negative of the derivative. 

This sifting property of the doublet, like the sifting property of 5(t), has a graphical 
interpretation. The integrated area under the doublet is zero, so if it is multiplied by a 
constant, the resulting integral is also zero. However, if the doublet is multiplied by 
a function that has different values to the right and left of center (i.e., a function with 
a nonzero derivative at that point), then one side of the integrand will have more area 
than the other. If the derivative of f(t) is positive, the negative area of the doublet is 
scaled more than the positive area and the result of the sifting integration is negative. 
This situation is shown in Figure 5.16. If, on the other hand, the derivative of f(1) 


f(t) d8,(t)/dt 


d8,(t)/dt 


Figure 5.16 The Sifting Property of the Doublet 
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is negative, the positive area of the doublet is enhanced more than the negative area, 
and the sifting integration produces a positive result. 

Similar arguments can be extended to higher derivatives of the 5-function. The 
appropriate sifting property is arrived at by applying integration by parts a number 
of times, until the integral is placed in the form of Equation 5.44. 


5.5 SINGULAR DENSITY FUNCTIONS 


One of the more common uses of the Dirac 5-function is to describe density functions 
for singular distributions. Typically, a distribution function describes a continuous, 
“per unit volume” quantity such as charge density, mass density, or number density. 
Singular distributions are not continuous, but describe distributions that are confined 
to sheets, lines, or points. The example of the density of a point mass was briefly 
described earlier in this chapter. 


5.5.1 Point Mass Distributions 


The simplest singular mass distribution describes a point of mass m, located at the 
origin of a Cartesian coordinate system, as shown in Figure 5.4. The mass density 
function p,,(x, y,z) that describes this distribution must have the units of mass per 
volume and must be zero everywhere, except at the origin. In addition, any volume 
integral of the density which includes the origin must give a total mass of m,, while 
integrals that exclude the origin must give zero mass. The expression 


Pm(X, Y,Z) = M8(x)8(y)8(Z) (5.45) 


satisfies all of these requirements. Clearly it is zero unless x, y, and z are zero. 
Therefore integrating p,, over a volume that does not include the origin produces 
zero. Integrating over a volume that contains the origin results in 


+é€ +€ +€ 
far m,8(x)5(y)8(z) -/ ax | dy [ dz Mm 8(x)5(y)8(z) 
Vv ~e SE —€ 
= Mp. (5.46) 


Because { dx 5(x) = 1, 5(x) has the inverse dimensions of its argument, or in this 
case, 1/length. Therefore, m,5(x)5(y)5(z) has the proper dimensions of mass per 
unit volume. 

To shift the point mass to a different location than the origin, simply use shifted 
5-functions: 


Pm = M8(x — x_)5(y — Yo) S(Z — Zo). (5.47) 


This function has the proper dimensions of mass per unit volume, and p,, is zero 
except at the point (x, yo, Z,). The integral over a volume V correctly produces zero 
mass if V does not include the point (Xp, yo, Z.) and mz if it does. 
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Consider the very same point, but now try to write the mass density in cylindrical 
coordinates, i.€., Pm(p, &, z). In this system, the coordinates of the mass point are 


(po, do, Zo) where 
Po = /%5 + ¥6 


bo = tan”! (yo/Xxp) (5.48) 
Zo = Zo. 
A natural guess for p»(p, b, 2) might be 


m,5(p — po)5(b ~ bo)5(Z — Zo). (5.49) 


This density clearly goes to zero, unless p = po, & = do and z = Z, as it should. 
However, because @ is dimensionless, Equation 5.49 has the dimensions of mass per 
unit area, which is not correct. Also, the integral over a volume V becomes 


/ dr pe = y ap iy pd - di nSlp = pO S)SE=W. “650 
Vv 


This is zero if V does not contain the point (p,, $o, 2), but when V does contain the 
point, the result is m7,po, not m, as required. This is because the dp integration gives 


Pot 
i dp p8(p — Po) = Po. (5.51) 
p. 


lp € 


Therefore, Equation 5.49 is not the proper expression for the mass density in cylin- 
drical coordinates, because it does not have the proper dimensions, and integration 
does not produce the total mass. From the discussion above, it is clear that the correct 
density function is 


pa lp — po)8( — $0)8(z — Zo). (5.52) 


This example demonstrates an important point. While the point distributions in 
a Cartesian system can be determined very intuitively, a little more care must be 
used with non-Cartesian coordinates. In generalized curvilinear coordinates, where 
dt = hyhyh3dq,dqzdq;3, the expression for a point mass at (410, 920, 930) IS 


8(41 — Gio) §(g2 — 920) 6(43 — 930) 


5.5 
hy hy hy OP?) 


Pm(41+ 42,93) = Mo 


There is a common shorthand notation used for three-dimensional singular distri- 
butions. The three-dimensional Dirac 6-function is defined by 


ee 5(41 — 410) 8(92 = 920) 6(92 ~ 920) 
3 os 
6°(F —T,) = eer ace ha hi : (5.54) 
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In Cartesian coordinates, this is simply 
85°F — Fo) = 8(x — x)8(y — Yo)S(Z — Zo): (5.55) 


Using this notation, the mass density of point mass located at r, is, regardless of the 
coordinate system chosen, given by 


Pm(F) = m,8°(F — F,). (5.56) 


If there are several points with mass m; at position r;, the density function becomes 
a sum of 5-functions: 


Pmt) = Sm d°(F — F)). (5.57) 
Integrating over a volume V gives 


dt) m&(F — Fi), (5.58) 
ea: 
which, when evaluated, is simply the sum of all the masses enclosed in V. 


5.5.2 Sheet Distributions 


Imagine a two-dimensional planar sheet of uniform mass per unit area o,, located in 
the z = z, plane of a Cartesian system, as shown in Figure 5.17. The intuitive guess 
for the mass density p,, in Cartesian coordinates is 


Pm(X, y,Z) = Gy 5(Z — Zo). (5.59) 


The 6-function makes sure all the mass is in the z = z, plane. The dimensions are 
correct, because o, has the dimensions of mass per area, and the 5-function adds 
another 1 /length, to give p,, the dimensions of mass per unit volume. The real check, 


Figure 5.17 Simple Infinite Planar Sheet 
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however, is the volume integral. In order for Equation 5.59 to be correct, an integral 
of the surface mass density over some part of the sheet § must give the same total 
mass as a volume integral of p,,, over a volume V, which encloses S: 


| do Om(F) = i dT pm(F)- (5.60) 
AY Vv 


For the case described above, Equation 5.60 expands on both sides to 


ih ax f ay Oo = fa fay fac O8(Z — 2); (5.61) 


which is indeed true, because the range of the z integration includes the zero of 
the 5-function. Thus the assumption of Equation 5.59 was correct. Notice how the 
5-function effectively converts the volume integral into a surface integral. 
Unfortunately, things are not always this easy. The previous example was particu- 
larly simple, because the sheet was lying in the plane given by z = z,. Now consider 
the same sheet positioned in the plane y = x, as shown in Figure 5.18. In this case 
the mass is only located where y = x, and the intuitive guess for the mass density is 


O5(y — x). (5.62) 


But our intuition is incorrect in this case. The surface integral on the LHS of Equa- 


tion 5.60 expands to 
eo On(T) = fos [a Do, (5.63) 


where ds is the differential length on the surface in the xy-plane, as shown in Fig- 
ure 5.19. Notice that 


ds = \/(dx)? + (dy? 
= /2dx, (5.64) 


Figure 5.18 A Tilted Planar Mass Sheet 
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Figure 5.19 The Differential Length ds Along the Surface x = y in the xy-Plane 


where the last step follows because dx = dy for this problem. Thus Equation 5.63 
becomes 


F, do om(F) = V2 / dx / dz Go. (5.65) 


If we use the expression in 5.62 for p,,, the volume integral on the RHS of Equa- 


tion 5.60 is 
[are = [a [ay [ac od(y — x) 
= [ax [ deas, (5.66) 


where the last step results from performing the integration over y. Comparing Equa- 
tions 5.66 and 5.65, we see that they are off by a factor of J ! 

Where does this discrepancy come from? The problem was our assumption in 
Equation 5.62 to use a 6-function in the form 5(y — x). We could have very well 
chosen [7]8(y — x), where [?] is some function that we need to determine. This still 
makes all the mass lie in the y = x plane. The correct choice of [?] is the one that 
makes both sides of Equation 5.60 equal. For example, when we make our “guess” 
for the distribution function for this example, we write 


Pm(x, YZ) = [2]oo5(x — y). (5.67) 


Then, when we evaluate the RHS of Equation 5.60, we get 


[arom = [ax [ay [az ont ~ 0 
= [dxf del (5.68) 


In this case, we already showed that the value of [?] is simply the constant BOs In 
more complicated problems, [?] can be a function of the coordinates. This can happen 
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if either the mass per unit area of the sheet is not constant, or if the sheet is not flat. 
You will see some examples of this in the next section and in the problems at the end 
of this chapter. 


5.5.3 Line Distributions 


As a final example of singular density functions, consider the mass per unit volume 
of a one-dimensional! wire of uniform mass per unit length A,. The wire is bent to 
follow the parabola y = Cx? in the z = 0 plane, as shown in Figure 5.20. The factor 
C is a constant, which has units of 1/length. We will follow the same procedure in 
constructing the mass density for this wire as we did for the previous example. In this 
case, the volume integral of the mass density must collapse to a line integral along 
the wire 


[a Pm(T) = fa Am(5). (5.69) 
Vv L 


In this equation, s is a variable which indicates parametrically where we are on the 
wire, and A,,(s) is the mass per unit length of the wire at the position s. 
Because all the mass must lie on the wire, we write the mass density function as 


Pm(X.¥,2) = [MA.d(y — Cx?)8(z). (5.70) 


Here we have used two 5-functions. The 5(z) term ensures all the mass lies in the z = 0 
plane, while &(y — Cx”) makes the mass lie along the parabola. As before, we include 
an unknown factor of [?], which we will have to determine using Equation 5.69. 

In terms of Cartesian coordinates, the general expression for the differential arc 
length ds is 


ds = \/(dxP + (dyY + (do. (5.71) 


Figure 5.20 Parabolic Line Distribution 
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Along the wire, z = 0 and y = Cx’, so that 


ds = V1 + 4C2x? dx. (5.72) 


Using Equation 5.70 and the fact that dr = dx dy dz, Equation 5.69 becomes 
if dx ‘| dy if dz [2]A,8(y — Cx*)8(z) = } dx V1 + 4C2x? Ag. (5.73) 
Let’s concentrate on the LHS of this equation. The integral over z is easy, because 
ja 6(z) = 1. (5.74) 


Also, when we do the integral over y, x is held constant, and only one value of y 
makes the argument of the 5-function vanish, so we have 


/ dy &(y — Cx’) = 1. (5.75) 
Therefore, Equation 5.73 becomes 
[]A, = [a V1+4C2x2 A,, (5.76) 


and the value of [?] clearly must be 


[7] = V1 4+ 4C2x?. (5.77) 


Notice in this case that [?] is a function of position. The mass density for the wire is 
given by 


Pm(X, Ys 2) = V1 + 4C2x? A,8(y — Cx”) 8(z). (5.78) 


When we converted the volume integral in Equation 5.73 to a line integral, it was 
easier to perform the dy integration before the dx integration. This was because when 
we integrated over y, holding x fixed, the only value of y which made the argument 
of the 5-function zero was y = Cx”. Another way to look at this is that there is a 
one-to-one relationship between dx and ds, as shown in Figure 5.21. 

If instead, we performed the x integration first, holding the value of y fixed, there 
are two values of x which zero the 5-function argument. In this case, the integral 
becomes 


[480 - cx) = = f ax 8 ([x~ v97E ][x+ Vy/C |) 


c 


cs : [a [(x- V9/E) + 8(x + vy/C )]. (5.79) 


4Cy 
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dx dx 
Figure 5.21 The Relation Between dx and ds Along the Parabola 


Figure 5.22 The Relation Between dy and ds Along the Parabola 


Looking at Figure 5.22 shows what is going on here. There is not a one-to-one 
relationship between dy and ds. Of course, evaluating the integral in this way produces 
exactly the same result for p,,(F), as you will prove in one of the exercises of this 
chapter. 


5.6 THE INFINITESIMAL ELECTRIC DIPOLE 


The example of the infinitesimal electric dipole is one of the more interesting appli- 
cations of the Dirac 6-function and makes use of many of its properties. 


5.6.1 Moments of a Charge Distribution 


In electromagnetism, the distribution of charge density in space p,(F) can be ex- 
panded, in what is generally called a multipole expansion, into a sum of its moments. 
These moments are useful for approximating the potential fields associated with com- 
plicated charge distributions in the far field limit (that is, far away from the charges). 
Each moment is generated by calculating a different volume integral of the charge 
distribution over all space. Because our goal is not to derive the mathematics of 
multipole expansions, but rather to demonstrate the use of the Dirac 5-functions, the 
multipole expansion results are stated here without proof. Derivations can be found 
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in Most any intermediate or advanced book on electromagnetism, such as Jackson’s 
Classical Electrodynamics. 

The lowest term in the expansion is a scalar called the monopole moment. It is 
just the total charge of the distribution and is determined by calculating the volume 
integral of p,: 


Q= dt p-(f). (5.80) 


All space 


The next highest moment is a vector quantity called the dipole moment, which is 
generated from the volume integral of the charge density times the position vector: 


p= i dt ¥p,(F). (5.81) 
All space 


The next moment, referred to as the quadrapole moment, is a tensor quantity generated 
by the integral 


Q= " dr (3rF — [rl?i )pc(r). (5.82) 
All space 


In this equation, the quantity F F is a dyad, and 1 is the identity tensor. There are an 
infinite number of higher-order moments beyond these three, but they are used less 
frequently, usually only in cases where the first three moments are zero. 

Far away from the charges, the electric potential can be approximated by summing 
the contributions from each of the moments. The potential field ® due to the first few 
moments is 


Be 1 |O F¥-P F-Q:-F 
= ~ + + ase! 
©) 4m€, |r r3 ars oe) 


It is quite useful to know what charge distributions generate just a single term in 
this expansion, and what potentials and electric fields are associated with them. For 
example, what charge distribution has just a dipole term (that is, P # 0) while all 
other terms are zero (O = 0, Q = 0, etc.). The Dirac 6-function turns out to be quite 
useful in describing these particular distributions. 


5.6.2 The Electric Monopole 


The distribution that generates just the Q/r term in Equation 5.83 is called the electric 
monopole. As you may have suspected, it is simply the distribution of a point charge 
at the origin: 


Prone = God (F). (5.84) 
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Its monopole moment, 
Q= dt qo5°(F) = Go, (5.85) 
All space 


is simply equal to the total charge. The dipole, quadrapole, and higher moments of 
this distribution are all zero: 


P= | dt gor 5°(F) = 0 (5.86) 
All space 


Q= dr (3Fr — r°1)d*(r) = 0. (5.87) 


All space 


The electric field of a monopole obeys Coulomb’s law: 


(5.88) 


5.6.3 The Electric Dipole 


An electric dipole consists of two equal point charges, of opposite sign, separated by 
some finite distance d,, as shown in Figure 5.23. The charge density of this system, 
expressed using Dirac 5-functions, is 


Papi = do 3 (r oe a) = 5 ¢ + $2) ’ (5.89) 


where in this case, the dipole is oriented along the x-axis. 
You might be tempted to believe that this distribution has only a dipole moment. 


Indeed both Q and Q are zero, while the dipole moment is given by 


= d d 
P= yisc (ne “26 —~ §° (Ft 7é 
fa ok f (= 5 é.) 6 (- 5 é.)| 


ody odo » 
= 40 28 + te &, (5.90) 


= dod ky. 


The higher-order moments for the dipole distribution, however, do not vanish. 


Figure 5.23. The Electric Dipole 
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The “ideal” dipole refers to a charge distribution that has only a dipole moment, 
and no other. It is the limiting case of the “physical” dipole, described above. In 
this limit, the distance between the charges becomes vanishingly small, while the net 
dipole moment P is held constant. In other words, d) —> 0 and g, — © such that 
doGo = Po is held constant. The charge distribution for this situation is 


d d 
= PONS ia Og. Vo BS gee BOR) : 
Picen = jim * re ab (*- 5 é.) 6 (= 5 é.)] (5.91) 
Expansion of the 5-functions in terms of Cartesian coordinates gives 
. | d, do 
eat = — ~—}- + )). ; 
Picea = Pod(y)5(Z) Jim, Z, 3 (: 5 ) ce) (: 5) )| (5.92) 
Notice that this limit is just the definition of a derivative: 
ac al d, do d8(x) 
lim — -s+}- +—j)}=- , : 
dp70 d, 3 ( 2 ) (: 2 )| dx O78) 


This means the charge distribution of an ideal dipole, with a dipole moment of 
magnitude p, oriented along the x axis, can be written 


oo 


Prtear = —Po 5(y)8(z). (5.94) 
The electric field from the ideal dipole (and also from a physical dipole in the far 
field limit) is the solution of 


7-E- ©) _ pedis 


E : 5(y)S(2). (5.95) 


But we already know the solution for the electric monopole is 


0. 1 Qo®, _ do 
¥ ve } = POO) 8). (5.96) 


ae 


Operating on both sides of Equation 5.96 with [—d,0/dx], shows the electric field in 
Equation 5.95 obeys 


Rp d 1 Gob, 
E= : 5.97 
doa 4ae, r2 } er 


Taking the derivative gives the result: 


E= —?e { 4-2} (5.98) 
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This is the solution for a dipole oriented along the x-axis. In general, the dipole is a 
vector P, which can be oriented in any direction, and Equation 5.98 generalizes to 
= J 2 £- 
E = — , j3(P -é,)é, — P}. 5.99 
meee ( ) } Oz) 
Notice that the magnitude of this field drops off faster with r than the field of 
the monopole. The higher the order of the moment, the faster its field decays with 
distance. 


5.6.4 Fields from Higher-Order Moments 


The technique described above can be used to find the electric field and charge 
distributions for the ideal electric quadrapole, as well as for all the higher moments. 
The charge distributions can be constructed using 5-functions, and the fields can be 
obtained by taking various derivatives of the monopole field. You can practice this 
technique with the quadrapole moment in an exercise at the end of this chapter. 


5.7 RIEMANN INTEGRATION AND THE DIRAC 6-FUNCTION 


The §-function provides a useful, conceptual technique for viewing integration which 
will become important when we discuss Green’s functions in a later chapter. From 
Equation 5.14, the sifting integral definition of the 5-function, any continuous func- 
tion f(y) can be written 


+a 
fo) = [dxdt feo, (5.100) 
The Riemann definition of integration says that an integral can be viewed as the limit 
of a discrete sum of rectangles, 


n= +00 


+x 
/ ; dx g(x) = ee g(nAx)Ax, (5.101) 


=—00 


where Ax is the width of the rectangular blocks which subdivide the area being 
integrated, and n is an integer which indexes each rectangle. The limiting process 
increases the number of rectangles, so for well-behaved functions, the approximation 
becomes more and more accurate as Ax — 0. The Riemann definition is pictured 
graphically in Figures 5.24(a) and (b). 

Using the Riemann definition, Equation 5.100 becomes 


FO) i dx f(x)8(x — y) 


ro) 
n=+20 


lim 2s f(nAx)8(nAx — y)Ax. (5.102) 
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»" 
(a) Ax) 
Figure 5.24 Discrete Sum Representation of an Integral 
area of n“ 6- function 
= f(nAx)Ax 
f(y) --- | £ 
i 7 
: f(nAx)Ax |8(y — nAx 
, > ltlaax)ax]ely ~ nsx) 
y 


Ax 
Figure 5.25 The Construction of f(y) from an Infinite Sum of 6-Functions 


Equation 5.102 makes a very interesting statement: Any continuous function can be 
viewed as the sum of an infinite number of 5-functions. Both f(y) and the RHS of 
Equation 5.102 are functions of y, and are plotted in Figure 5.25 for finite Ax. In 
this figure, the 5-functions are located at y = nAx and have an area FCnAx)Ax. 
The function f(y) is generated by this sum of 6-functions as Ax — 0, i.e., as the 
spacing between the 6-functions and their areas go to zero. Thus an infinite number 
of infinitesimal area 5-functions, spaced arbitrarily close together, combine to form 
the continuous function f(y)! 


EXERCISES FOR CHAPTER 5 


1. Simplify the integral 


| dx f(x)6(—ax + b), 


where a and b are real, positive constants. 
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2. Perform these integrations which involve the Dirac 6-function: 

0 

i. / dx &(x — 1). 
~3 

i | dx (x? + 3)8(x — 5). 
5 

ui. f dx x 5(x* — 5). 
-5 


5 
iv. / dx x (x? + 5). 
-5 


Qn 
v [ dx &(cos x). 
0 


3/4 
vi. | dx x* 8(cos x). 
a/4 


vii. ‘ dx x? ge) 
a dx — 


10 us 
viii. / ax 2 + 3)[SC—9. 


-10 dx 


3. Determine the integral properties of the “triplet” d?5(t)/dt? by evaluating the 
integrals 


. fo (ade) 

i. a ar ( an ). 

ae d?8(t — t) 

H. [4 fo (HE) : 


4. The function A(x) is generated from the function g(x) by the integral 


h(x) = ie dx' ato ae =@) 


If h(x) is the triangular pulse shown below, find and plot g(x). 


» h(x) 
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5. Given that 
_ fo x<0O 
fo) = 4. x>0’ 
find and plot the first and second derivatives of f(x). 


6. An ideal impulse moving in space and time can be described by the function, 
F(x, t) = [,6(x — vot), 


where v, is a constant. Make a three-dimensional plot for f(x, t) vs. x and t to 
show how this impulse propagates. What are the dimensions of v,? How does 
the plot change if 


f(x, 0) = 1,8(x — aot” /2), 
where a, is a constant? What are the dimensions of a,? 
7. Consider the sinc function sequence: 


6,(t) = sin (n7)/(ax). 


(a) On the same graph, plot three of these functions with n = 1, n = 10, and 
n = 100. 

(b) Prove that the limit of the sinc sequence functions acts like a 6-function by 
deriving the relation: 


i at [im mm f(t) = vm t<O<t : (5.103) 


ot otherwise 


AS a first step, try making the substitution y = nt. You will need to use the 
identity: 


8. The function 5(cos x) can be written as a sum of Dirac 5-functions 


6(cos x) = So an 8(x — Xn). 


Find the range for n and the values for the a, and the x,. 


9. A single point charge q, is located at (1, 1,0) in a Cartesian coordinate system, 
so that its charge density can be expressed as 


P(X, ¥,Z) = Go B(x — 1) 6(y — 1) &(2). 
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(a) What is p.(p, 9, z), its charge density in cylindrical coordinates? 
(b) What is p.(r, 0, ), its charge density in spherical coordinates? 


10. An infinitely long, one-dimensional wire of mass per unit length A, is bent to 
follow the curve y = A cosh (Bx), as shown below. 


ly 


Find an expression for the mass per unit volume p(x, y, z). Express your answer 
two ways: 


(a) As the product of two 6-functions. 
(b) As a sum of two terms, each the product of two 6-functions. 
Be sure to check the dimensions of your answers. 
11. An infinitely long, one-dimensional wire of mass per unit length A, is bent to 
follow the line formed by the intersection of the surface x = y with the surface 


y = 2, as shown in the figure below. Find an expression for p,,(x, y, z), the mass 
per unit volume of the wire. 
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12. An infinitely long, one-dimensional wire with a constant mass per unit length A, 
is bent to follow the curve y = sin x in the z = 0 plane. 


Determine the mass density p,,(x, y, z) that describes this mass distribution. 


13. A wire of mass per unit length A, is bent to follow the shape of a closed ellipse 
that lies in the xy-plane and is given by the expression 


e+ 2y? = 4, 


Express p,p,(x, y,z), the mass per unit volume of this object, using Dirac 6- 
functions. Show that your expression has the proper dimensions. There is more 
than one way to express the answer to this problem. Identify the most compact 
form. 


14. An infinite, one-dimensional bar of mass per unit length A, lies along the line 
y = mx in the z = 0 plane. 


(a) Determine the mass per unit volume p(x, y, z) of this bar. 

(b) Now consider the situation where the bar is rotating about the z-axis at a 
constant angular velocity w, so that the angle the bar makes with respect to 
the x-axis is given by 6 = w,f, as shown below. Find an expression for the 
time-dependent mass density p(x, y, z, f). 
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15. 


16. 


17. 


18. 


A charge Q, is evenly distributed along the x-axis fromx = —L,/2tox = L,/2, 
as shown below. 


(a) Using the Heaviside step function, what is the charge density p,(x, y, 2), 
expressed using Cartesian coordinates? 


(b) What is this charge density in cylindrical coordinates? 


Using 6-functions and the Heaviside step function, express the charge density 
p-(F) of a uniformly charged cylindrical shell of radius r, and length L,. The 
total charge on the surface of the shell is Q,. 


An infinite, two-dimensional sheet with mass per unit area o, is bent to follow 


the surface y = x°. 


(a) Make a plot of the curve made by the intersection of this sheet and the plane 
given by z = 0. 

(b) Determine the mass per unit volume p,,(x, y, Z)- 

Express the mass density p,,(p, 9, z) of a conical surface that is formed by cutting 

a pie-shaped piece from an infinite, uniform two-dimensional sheet of mass per 

unit area o, and joining the cut edges. The conical surface that results lies on the 

surface p = a, z where (p, 9, z) are the standard cylindrical coordinates. 


aie 
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19. An infinite, two-dimensional sheet with mass per unit area a, is bent to follow 

the surface xy = 1 in a Cartesian coordinate system. 

(a) Using the hyperbolic coordinates developed in Exercise 13 of Chapter 3, 
express the mass density p,,(u, v, z) for this sheet. 

(b) Using the equations relating the coordinates, convert your answer to part (a) 
above to Cartesian coordinates. 

(c) Now, working from scratch in a Cartesian system, obtain p(x, y,z) by 
requiring that this density function take the volume integral over all space to 
a surface integral over the hyperbolic surface. 


20. Express the mass density p,,(¥) for a spherical sheet of radius r,, with constant 
mass per unit area Oy. 


21. A dipole electric field is generated outside the surface of a sphere, if the charge 
per unit area on the surface of that sphere is distributed proportionally to cos(6). 
If the sphere has a radius r, and there is a total charge of +@Q, on the upper 
hemisphere and —Q, on the lower hemisphere, what is the expression for the 
charge density p,(r, 0, @) in spherical coordinates? 


22. In a two-dimensional Cartesian coordinate system, the mass density p,,(F) of a 
pair of point masses is given by 


Pm(X1, X2) = mo8(x1 + 1)8(x2) + m.6(%1 — 1)8(x2). 


Evaluate the integrals: 


i. | ,) dx dx2 Pm(%1, x2). 
i | , dx, dx2 T pm(x1, X2). 
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iii, i i. dx ,dx> {r : Yj Pm(%1; X2)- 


iv. / / dx, dx2 [FF] pm(%1, x2). 


23. Prove that the monopole and quadrapole moments of any dipole (“physical” or 
“ideal’”’) are zero. 


24. A quadrapole charge distribution consists of four point charges in the x,x2-plane 
as shown below. 


-~q,@ 1: e744, 


(a) Using Dirac 6-functions express the charge density, p-(x1, x2,x3), of this 
distribution. 

(b) The quadrapole moment of this charge distribution is a second rank tensor 
given by 


Q = Oi G8). 


The elements of the quadrapole tensor are given by the general expression 


Oi; — / dx; i dx, 2 dx; Pc(X1, X2, x3) [3xix; ay (xp.4)8i; | : 


where 6;; is the Kronecker delta. In particular, 


Qa = / dx; / dey | xs pe(X1, X24X3) (3x2x2 — (xf + xb + 35))- 


Evaluate all the elements of the quadrapole tensor for the charge distribution 
shown in the figure above. 

(c) Does this charge distribution have a dipole moment? 

(d) Find the coordinate system in which this quadrapole tensor is diagonal. 
Express the elements of Q in this system. 


25. An ideal quadrapole has a charge density p(x, y, y) that is zero everywhere 
except at the origin. It has zero total charge, zero dipole moment, and a nonzero 
quadrapole moment. 
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(a) Show that if p(x, y, z) has the form 


= d8(x) d&(y) 
Pc(x, Y, 2) I “7. &(2), 


it satisfies the above requirements for an ideal quadrapole. Evaluate [?] so 
that the elements of the quadrapole moment tensor for this distribution are 
the same as the quadrapole elements in Exercise 24. 


(b) Determine the electric field produced by this ideal quadrapole. 


6 


INTRODUCTION TO 
COMPLEX VARIABLES 


One of the most useful mathematical skills to develop for solving physics and en- 
gineering problems is the ability to move with ease in the complex plane. For that 
reason, two chapters of this book are devoted to the subject of complex variable 
theory. This first chapter is primarily restricted to the fundamentals of complex vari- 
ables and single-valued complex functions. Some knowledge of complex numbers is 
assumed. 


6.1 A COMPLEX NUMBER REFRESHER 


Complex numbers are based on the imaginary number “i”, which is defined as the 
positive square root of —1: 


i= /-1. (6.1) 
In general, a complex number z has a real and an imaginary part, 

zZ=xtiy, (6.2) 
where x and y are both real numbers. If y is zero for a particular complex number, 


that number is called pure real. If x is zero, the number is pure imaginary. 


6.1.1 The Complex Plane 


A complex number can be plotted as a point in the complex plane. This plane is 
simply a two-dimensional orthogonal coordinate system, where the real part of the 
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Figure 6.1 A Plot of the Complex Number z = x + iy in the Complex Plane 


complex number is plotted on the horizontal axis and the imaginary part is plotted on 
the vertical axis. A plot of the complex number z = x + iy is shown in Figure 6.1. 
The magnitude of the complex number z = x + iy is 


Iz] = x? + y?, (6.3) 


which is simply the distance from the origin to the point on the complex plane. 


6.1.2 Complex Conjugates 


The conjugate of a complex number can be viewed as the reflection of the number 
through the real axis of the complex plane. If z = x + iy, its complex conjugate is 
defined as 

Zz =x— iy. (6.4) 


The complex conjugate is useful for determining the magnitude of a complex 
number: 


zz" = (x + iy)(x — iy) = x7 + y? = [gI”. (6.5) 


In addition, the real and imaginary parts of a complex variable can be isolated using 
the pair of expressions: 


(6.6) 


tz @tiy) + (iy) _ 
2 2 : 
g-z2 _ @ti)- Oy) _ 
2i 2i e 


6.1.3 The Exponential Function and Polar Representation 


There is another representation of complex quantities which follows from Euler’s 
equation, 


e” = cos@ + isiné. (6.8) 


To understand this expression, consider the Taylor series expansion of e* around zero: 
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fae d(e*) 1@e*) Bact. d>(e*) 5 
x=0 dx oe 2! dx? | | BN ax |. 
=1+x+t a + x +-- 
ES pee ae 7 (6.9) 


Now we make the assumption that this Taylor series is valid for all numbers including 
complex quantities. Actually, we define the complex exponential function such that 
it is equivalent to this series: 


$e pee, (6.10) 


Now let z = 76 to obtain 


-y2 -g\3 “a4 
i9y Go | GO 


io _ . 
ev =1+i0+ T + ry rm (6.11) 
Breaking this into real and imaginary parts gives 
; @.. 6? -. “98 e eo @ 
i@ ee Di ie Eero 
é 1 T + ai Gl + +il0 31 + 517i + . (6.12) 


The first bracketed term in Equation 6.12 is the Taylor series expansion for cos 6, and 
the second is the expansion for sin 6. Thus Euler’s equation is proven. 

Using this result, the complex variable z = x + iy can be written in the polar 
representation, 


= re 


tN 


= rcos@ + irsin@, (6.13) 


where the new variables r and @ are defined by 


r= (zl = x2 + y? (6.14) 
6 = tan! y/x. (6.15) 


The relationship between the two sets of variables is shown in Figure 6.2. 


imag 


Trea] 


Figure 6.2 The Polar Variables 
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6.2 FUNCTIONS OF A COMPLEX VARIABLE 


Just as with real variables, complex variables can be related to one another using 
functions. Imagine two complex variables, w and z. A general function relating these 
two quantities can be written as 


w = w(2). (6.16) 
If w = « + iv andz = x + iy, then we can rewrite this relationship as 
w(z) = u(x, y) + iv(x, y), (6.17) 


where u(x, y) and v(x, y) are both real functions of real variables. 
As a simple example of a complex function, consider the relationship 


w(z) = 2. (6.18) 
Expanding z = x + iy gives 
w(z) = (x? — y?) + ixy. (6.19) 
The functions u(x, y) and v(x, y) from Equation 6.17 are easily identified as 
u(x, y) = x* — y* (6.20) 
u(x, y) = 2xy. (6.21) 


6.2.1 Visualization of Complex Functions 


A relationship between w and z can be visualized in two ways. The first interpretation 
is as a mapping of points from the complex z-plane to the complex w-plane. The 
function w = z maps the point z = 2i into the point w = —4 as shown in Figure 6.3. 
Another point z = 1 + i maps to w = 2i. By mapping the entire z-plane onto the 
w-plane in this way, we can obtain a qualitative picture of the function. 

A second, more abstract visualization is simply to plot the two functions u(x, y) 
and v(x, y) as the real and imaginary components of w. This method is shown in 
Figure 6.4 for the function w = sin z. 


* z-plane jx) 
w-plane 
@ Zo = 2i W.= 2 2 
‘ =a so | 
wWo= —4 | 
x u 
got age ea 


Figure 6.3 The Mapping Property of a Complex Function 
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Figure 6.4 The Plots of the Real and Imaginary Parts of w(z) as Functions of x and y 


6.2.2 Extending the Domain of Elementary Functions 


The elementary functions of real variables can be extended to deal with complex 
numbers. One example of this, the complex exponential function, was already intro- 
duced in the previous section. We defined that function by extending its Taylor series 
expansion to cover complex quantities. Series expansions of complex functions are 
covered in much greater detail later in this chapter. 


Trigonometric Functions We can perform a similar extension with the sine func- 
tion. The Taylor series of sin(x) expanded around zero is 
3 5 7 
pe ee ee 
sminx = x 31 5 a 
where x is a real variable. The complex sine function can be defined by extending 
this series to complex values, 


(6.22) 


(6.23) 


where z is now a complex quantity. This series converges for all z. Similarly, the 
complex cosine function has the series 
2 4 6 
Coser he ae at er (6.24) 
which also converges for all values of z. 
With these extended definitions, Euler’s equation (Equation 6.8) easily generalizes 
to all complex quantities: 


e= = cosz + isinz. (6.25) 


This allows the derivation of the commonly used expressions for the sine and cosine 
of complex variables: 


(6.26) 


(6.27) 
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The other trigonometric functions can be extended into the complex plane using the 
standard relationships. For example, 


sing el — e 2 


cosz i(elz + ei)" (6.28) 


Hyperbolic Functions The hyperbolic sine and cosine functions of real variables 
are defined as 


sinh x = <—*— (6.29) 
teee= a (6.30) 


The complex hyperbolic functions are defined by following the same rules using the 
complex exponential function: 


fpr 
sinhz = + (6.31) 
Rapes 
coshz = —. (6.32) 


The Logarithmic Function The definition for the logarithm of a complex variable 
comes directly from its polar representation: 


Inz = In(re'®) 
= Inr + In(e"*) 
=Inr + i6. (6.33) 


6.3 DERIVATIVES OF COMPLEX FUNCTIONS 


The derivative of a complex function is defined in the same way as it is for a function 
of a real variable: 


dw _ 4, WE + AD — wo 


6.34 
dz Az—0 Az ( ) 


This statement is not as simple as it appears, however, because Az = Ax + iAy, 
and consequently Az can be made up of any combination of Ax and iAy. That is 
to say, the point z can be approached from any direction in the complex z-plane. 
It is not obvious, and in fact not always the case, that the derivative, as defined in 
Equation 6.34, will have the same values for different choices of Az. 

Functions that have a unique derivative in a finite region are said to be analytic 
in that region. As will be shown, the Cauchy Integral Theorem, the Cauchy Integral 
Formula, and, in fact, the whole theory of complex integration are based on the 
analytic properties of complex functions. 
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6.3.1 The Cauchy-Riemann Conditions 


A necessary condition for the uniqueness of the derivative of w(z) is that we obtain 
the same value for it whether Az is made up of purely a Ax or purely an iAy. If 
Az = Ax, Equation 6.34 becomes 


GWG) Wee ae Ey) = eon) 


dz Ax—0 Ax oo 
On the other hand, if Az = iAy, the derivative is 
+iy +idy)—- +i 
AWAZ) gi WO * ty + Ay) — wer + ty) (6.36) 


dz Ay-0 iAy 


If w(z) is broken up into real and imaginary parts, w(z) = u(x, y) + iv(x, y), Equa- 
tions 6.35 and 6.36 become 


dw(z) = ee u(x + Ax, y) + iv(x + Ax, y) — u(x, y) — iv(x, y) 


dz Ax—-0 Ax 
_ du(x, y) i pew): (6.37) 
ax Ox 
and 
dw(z) _ = u(x,y + Ay) + iv(x,y + Ay) — ux, y) — iv{x, y) 
dz Ay-0 iAy 
_ uy) , Ivey) (6.38) 


oy oy 


For these derivatives to be equal, their real and imaginary parts must be equal. 
This requirement leads to two equations involving the derivatives of the real and 
imaginary parts of w(z): 


ou(x, y) = dv(x, y) (6.39) 
ax ay , 
u(x, y) Renee u(x, y) (6.40) 
oy ox : 


Equations 6.39 and 6.40 are known as the Cauchy-Riemann conditions. They must 
be satisfied, at a point, if the derivative at that point is to be the same for Az = Ax 
or Az = iAy. Therefore, they are necessary conditions for the uniqueness of the 
derivative of a complex function. 


6.3.2 Analytic Functions 


The Cauchy-Riemann conditions are not enough to guarantee the uniqueness of the 
derivative. To obtain a sufficient condition, the function must meet some additional 
requirements. 
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If Az is made up of an arbitrary combination of Ax and iAy, then the derivative 
of w(z) can be written as: 


dw) _ 4, wet Ax + iAy) ~ we) 


6.41 
dz Ax,Ay—0 Ax +iAy ( ) 


A Taylor series expansion of the real and imaginary parts of w(z + Ax + iAy) about 
the point z = x + iy gives: 
dw(z) : (du/ax + idv/dx)Ax + (du/dy + idv/dy)Ay 
a lim - ‘ 
az AxAy—0 Ax+idy 


(6.42) 


For this equation to be valid, the partial derivatives of u(x, y) and u(x, y) must exist 
and be continuous in a neighborhood around the point z. A neighborhood is a region of 
arbitrarily small, but nonzero size surrounding a point in the complex plane. Dividing 
the numerator and denominator of the RHS by Ax before taking the limit gives 


dw(z) _ i (du/ax + idv/dx) + (du/dy + idv/dy)(Ay/Ax) 
dz eran 1+ iAy/Ax ; 


(6.43) 


If the derivative is to be independent of how the limit is taken with respect to Ax and 
Ay, the RHS of Equation 6.43 must be independent of Ax and Ay. If the Cauchy- 
Riemann conditions are valid not just at the point z, but in the small region Ax Ay 
about z, i.e., in the neighborhood of z, they can be substituted into the RHS of 
Equation 6.43 to give 


dw(z) a3. Aes (ou/dx + idv/dx) + (—dv/dx + idu/dx)(Ay/Ax) 
r4 


d Tere 1+ iAy/Ax 
: 1+ iAy/Ax 
= | + id —_—__—— 44 
Pe as : w/o + ae Ge) 
ou du 
=— +1I—, 
Ox Ox 
which is independent of Az. 


Consequently, if the partial derivatives of u(x, y) and v(x, y) exist, are continuous, 
and obey the Cauchy-Riemann conditions in a neighborhood around a point z, the 
derivative of the function at that point is guaranteed to be unique. Functions that 
obey all these conditions are called analytic. A function may be analytic in the entire 
z-plane or in isolated regions. But because the Cauchy-Riemann conditions must be 
satisfied in a neighborhood of the complex plane, a function cannot be analytic at an 
isolated point or along a line. The sum, product, quotient (as long as the denominator 
is not zero) and composition of two functions that are analytic in a region are analytic 
in that same region. Proofs of these statements are left as an exercise for the reader. 

There are many useful complex variable techniques which require the use of ana- 
lytic functions. In fact, the rest of the material in this chapter concerns the properties 
of analytic functions. 
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Example 6.1 _ First consider the function w = 2” with 


u(x,y) = xr - y? (6.45) 
v(x, y) = 2xy. (6.46) 
Taking the partial derivatives, 
ou(x, y) Bas Ov(x, y) (6.47) 
Ox oy 
and 
ou(x, y) a du(x, y) (6.48) 
oy Ox 


This function satisfies the Cauchy-Riemann conditions everywhere in the finite com- 
plex z-plane. From either Equation 6.37 or 6.38, 


dw _ odu(x,y) - ,¥@y) _ du(x,y) ; ux ¥) 
dz Ox Ox oy oy 


= 2x + i2y (6.49) 
= 22. 


Notice that this derivative could have been obtained by treating z in the same manner 
as a real, differential variable, i.e., 


= SS" = 2z, (6.50) 


Example 6.2 As another example, consider the function w = 1 /z. In this case, the 
real and imaginary parts of w are obtained by using z*, 


1 1 2 x—iy 
ee zg eye on 
so that 
as 6.52 
WW = aI (6.52) 
ulx,y) = =>. (6.53) 
x“ +y 


A check of the partial derivatives shows that the Cauchy-Riemann conditions are 
satisfied everywhere, except atz = 0, where the partial derivatives blow up. Therefore, 
1/z is analytic everywhere except at z = 0, and its derivative is 


d(1/z) _ 
dz 2 


(6.54) 


144 INTRODUCTION TO COMPLEX VARIABLES 


Example 6.3 As a final example, consider the function w = zz", so that 
w = (x t+ iy)(x — iy) = x? + y’. (6.55) 


The real and imaginary parts are 


u(x,y) = x° + y* (6.56) 
v(x, y) = 0 (6.57) 
so that 
u(x, y) _ x du(x,y) _ 0 (6.58) 
Ox oy 
and 
HOY) oy — Y 9, (6.59) 


The Cauchy-Riemann conditions are not satisfied, except at the origin. Because this 
a single, isolated point, the function w = zz" is not an analytic function anywhere. 
This is made more obvious when the derivative of this function is formed. Using 
Equation 6.37, the derivative is 


1G) a az 47', (6.60) 
dz mee 
while using Equation 6.38, 
d * 
Gs) 2 pyar es (6.61) 
dz Tor 


6.3.3 Derivative Formulae 


The rules you have learned for real derivatives hold for the derivatives of complex 
analytic functions. The standard chain, product, and quotient rules still apply. All the 
elementary functions have the same derivative formulae. For example: 


ds said ; 
— [e!2] = ie! — [cos(z)] = —ie'® sin z. (6.62) 
dz dz 3 = 


6.4 THE CAUCHY INTEGRAL THEOREM 


An important consequence of using analytic functions is the Cauchy Integral Theo- 
rem. Let w(z) be a complex function which is analytic in some region of the complex 


THE CAUCHY INTEGRAL THEOREM 145 


imag 


z-plane 


real 


Figure 6.5 The Closed Cauchy Integral Path 


plane. Consider an integral 


$ dzw() (6.63) 
Cc 


along a closed contour C which lies entirely within the analytic region, as shown in 
Figure 6.5. Because w(z) = u(x, y) + iv(x, y) and dz = dx + idy, Equation 6.63 can 
be rewritten as 


f deme) = f (dx + idy) tuls.y) + i069) (6.64) 
Cc (ss 
= $ [dx u(x, y) — dy v(x, y)] +i ¢ [dx u(x, y) + dy u(x, y)]. 
€ 
Stokes’s theorem 


far-v= fao-VxV (6.65) 
Cc 


s 


can now be applied to both integrals on the RHS of Equation 6.64. For a closed line 
integral confined to the xy-plane, Equation 6.65 becomes 


av, ov 
dxV,+dyV,= | d —- |. 6.66 
fax yVy [aa (% 2 (6.66) 


The first integral on the RHS of Equation 6.64 is handled by identifying V, = u(x, y), 
V, = —v(x, y), and S as the surface enclosed by C, so 


$ [dx u(x, y) — dy v(x, y)] = ~ if deay (z " *) (6.67) 
( s Ox oy 
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The second integral of Equation 6.64 is handled in a similar manner, by identifying 
V, = u(x, y) and Vy = u(, y), so 


ju 8 
[dx v(x, y) + dy u(x, y)] = [ dx dy (2 ~ =), (6.68) 


Because w(z) is analytic and obeys the Cauchy-Riemann conditions inside the entire 
closed contour, du/dx = dv/dy and du/dy = —dv/dx, and we are left with the 
elegant result that 


¢ dzw(z) = 0. . (6.69) 
Cc 


6.5 CONTOUR DEFORMATION 


There is an immediate, practical consequence of the Cauchy Integral Theorem. The 
contour of an complex integral can be arbitrarily deformed through an analytic region 
without changing the value of the integral. 

Consider the integral 


pe i dzw(z) (6.70) 


along the contour C, from a to b, drawn as the solid line in Figure 6.6(a). If we add 
to this an integral with the same integrand, but around the closed contour C, shown 
as the dotted line in Figure 6.6(a), the result will still be J, as long as the integrand is 
analytic in the entire region inside C: 


I =[ dzw(z) + f deme (6.71) 
‘ Cc 


imag 


Teal 


o- () 


Figure 6.6 Contour Deformation 
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But notice the part of C that lies along the original contour has an equal, but opposite 
contribution to the value, and the contributions from the two segments cancel, leaving 
the contour shown in Figure 6.6(b). Therefore, 


cS | dzwZ) = i dzw(z). (6.72) 
0 Cc 


Deformations of this kind will be very handy in the discussions that follow. 


6.6 THE CAUCHY INTEGRAL FORMULA 


The Cauchy Integral Formula is a natural extension of the Cauchy Integral Theorem 
discussed previously. Consider the integral 


Reg Gel. (6.73) 


qQ 27% 


where f(z) is analytic everywhere in the z-plane, and C, is a closed contour that 
does not include the point Zz,» as depicted in Figure 6.7. It has already been shown 
that the function 1/z is analytic everywhere except at z = 0. Therefore the function 
1f(z- Z,) is analytic everywhere, except at z = z,- Because C, does not enclose the 
point z = z,, the integrand in Equation 6.73 is analytic everywhere inside C,, and by 
Cauchy’s Integral Theorem, 


z 
i= é pericy = 0. (6.74) 
a £7-% 
Now consider a second integral, 
Zz 
hh =4 dz 2? (6.75) 
OQ £7 & 


similar to the first, except now the contour C2 encloses z,, as shown in Figure 6.8. The 
integrand in Equation 6.75 is not analytic inside C), so we cannot invoke the Cauchy 


imag z-plane 
Z® 


real 


Figure 6.7 The Contour C, for the Cauchy Integral Formula 
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imag z-plane 


teal 


Figure 6.8 The Contour C, for the Cauchy Integral Formula 


Integral Theorem to argue that J; = 0. However, the integrand is analytic everywhere, 
except at the point z = z,, so we can deform the contour into a infinitesimal circle of 
radius €, centered at z,, without affecting the value: 


h=lim ~ dz=——. (6.76) 


This deformation is depicted in Figure 6.9. 

The integral expression of Equation 6.76 can be visualized most easily with the use 
of phasors, which are essentially vectors in the complex plane. Consider Figure 6.10. 
In Equation 6.76, z lies on the contour C, and is represented by a phasor from the 
origin to that point. The point z, is represented by another phasor from the origin 
to z,. In the same way, the quantity z — z, is represented by a phasor that goes from 
z, to z. If the angle ¢ is defined as shown, using polar notation we have 


z—z, = ee’?. (6.77) 
On Cp, € is a constant so 
dz = iee'*dd, (6.78) 
: imag 
c z-plane 
real 


Figure 6.9 The Equivalent Circular Contour for the Cauchy Integral Formula 
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F Z-Z 
imag ~~ 


z-plane 


real 


Figure 6.10 Phasors for the Cauchy Formula Integration 


and @ goes from 0 to 27. The integral in Equation 6.76 becomes 


z, tee?) 
h = lim ; "age i ef (6.79) 


Ase — 0, f(z, + ee'?) goes to f(Z,) and can be taken outside the integral: 


2 
bh =if(z,) | dd = 27 f(z,). (6.80) 
0 
Our final result is 
L®@ , 2m f(Z,) z inside C : (6.81) 
cin ae 0 z, outside C 


where C is any closed, counterclockwise path that encloses z,, and f(z) is analytic 
inside C. 

Equation 6.81 is the simplest form of Cauchy’s Integral Formula. A more general 
result can be obtained by considering the point z, to be a variable. If we restrict z, to 
always lie inside the contour C, then Equation 6.81 can be differentiated with respect 
to z, to give 

d f(Z,) 1 £@ 


= —- dz 6.82 
dz, Imi Je *@—ze ween) 


In fact, Equation 6.81 can be differentiated any number of times, to give the most 
general form of Cauchy’s Integral Formula: 


d"f(z,) nt £@ 
de Omi Io *@— zy oo) 


Here again, C is any closed, counterclockwise contour, z, is any point inside C, and 
£@) is analytic everywhere inside C. 
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Equation 6.83 says something quite important about the nature of analytic func- 
tions. If the value of an analytic function is known everywhere on some closed contour 
C, the value of that function and all its derivatives are uniquely specified everywhere 
inside C. 


6.7 TAYLOR AND LAURENT SERIES 


Just as with real functions, complex functions can be expanded using Taylor series. 
An extension of the standard Taylor series, called the Laurent series, is also frequently 
used to expand complex functions. Series expansions play a crucial role in the theory 
of complex functions, because they provide the fundamental tools for deriving and 
using the theory of residues, described in the next section. 


6.7.1 Convergence 


Before discussing Taylor and Laurent expansions for complex functions, it is appro- 
priate to say a few words about the convergence of complex series. We assume you 
are already familiar with the convergence properties of real series, so we will not 
attempt to cover that topic with any sort of mathematical rigor. 

An infinite summation of real numbers x, is said to converge to the value S if 


oo 


S = 5° x, > A Finite Real Number. (6.84) 
n=0 


The series in Equation 6.84, is said to converge absolutely if 


> lal > A Finite Real Number. (6.85) 
n=0 


Because 


S> |xnl = bie (6.86) 
n=0 n=0 


absolute convergence is a more rigid requirement than convergence. That is, if a series 
converges absolutely it will always converge. However, a series may converge even 
though it does not converge absolutely. For example, the alternating harmonic series 


1 1 


Bh ol 
oat ad a ne 6. 
: 2 3 4 5 6 By 
converges to In 2, but the harmonic series 
1 #1, 1,121,141 
+ +o-+i+s4+-:- 6. 
neo go aos, 6 ee) 


diverges. 
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The ratio test can be used to establish the absolute convergence of a series. That 
is, if the terms of an infinite series satisfy 


Xn+1 
Xn 


lim <1 


no 


, (6.89) 


the series converges absolutely. Again, a series may converge even though it does not 
satisfy the ratio test. There are plenty of other tests for convergence, but the ratio test 
is the only one we need for our discussions. 

Things are not too different when we consider infinite series of complex numbers. 
A summation of complex numbers z, converges to the complex value S$ if 


S = 5°z, > A Finite Complex Number. (6.90) 


n=0 


For convergence of the infinite complex series, both the real and imaginary parts must 
converge. That is, if we write 2, = x, + iy, and S = S, + iS;, we must have 


S, = S- x, —> A Finite Real Number 
nee (6.91) 


5; = }> yn > A Finite Real Number. 
n=0 


The complex series defined in Equation 6.90 converges absolutely if 


», | z,| — A Finite Complex Number. (6.92) 
n=0 
Because 
lent = Do mn (6.93) 
n=0 n=0 
and 


dle] = Dye (6.94) 
n=0 n=0 


if the series converges absolutely, the two series in Equations 6.91 will both converge, 
and consequently the original complex series in Equation 6.90 must also converge. 
The absolute convergence of a complex series can also be established by the ratio 
test. That is, if the terms of the series satisfy 


Sn+1 


4 


&n 


lim <j 


n—00 


, (6.95) 


the series will converge absolutely. 
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6.7.2 Taylor and Laurent Series of Real Functions 


Before dealing with expansions for complex functions, Taylor and Laurent series for 
real functions will be reviewed. Recall that the Taylor series expansion of f(x), a real 
function of the variable x, around the point x, is 


f(x) = Par: — x5)" = Cy +e (X — Xo) + (xX — XP tee-. (6.96) 
n=0 


The coefficients c,, can be obtained from the expression 


_ ld’f@) 


“mnt dx” |x=x, 


(6.97) 


and depend upon both the function f(x) and the expansion point x,. The region of 
convergence of this series can be determined by using the ratio test. The series on the 
RHS of Equation 6.96 converges if 


Cr(X ~ Xo)” 


st  ] <1. 98 
Cn—1(X = Xo)! : oF ) 


lim 


n—-o 


In general a Taylor series will converge for values of x in a symmetric range around 
Xo, given by |x — x,| < A, where A depends upon both the expansion point and the 
function f(x). 

A Laurent series expansion differs from a Taylor series because of the additional 
terms with negative n values. An expansion of f(x) around x, using a Laurent series 
has the form 


£0) = SO o(a - x0)" (6.99) 


C2 C-} 
(x ~ Xo)? (x — Xe) 


+ Cy + c1(% — Xp) + E2(x — XY? tooo. 


Convergence for a Laurent series is slightly more complicated to verify than for 
a Taylor series, because we must check the convergence of both the positive and 
negative terms. In many cases, the negative terms of the series terminate at some 
finite value of n, and we only need to worry about the convergence of the positive 
terms. For the general case, however, we can perform two ratio tests, one for the 
positive terms and one for the negative terms. Clearly, if there are any terms with 
negative n values, the series cannot converge at x = x,. In general, a Laurent series 
will converge in a symmetric range of x given by Amin < |x — Xol < Amax, where 
Amin and A,,,, are the minimum and maximum allowed distances from the expansion 
point. 
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Figure 6.11. Polynomial Long Division 


6.7.3 A Basic Complex Taylor Expansion 


As part of the derivation of complex Taylor and Laurent series, the following basic 
complex Taylor expansion will prove to be valuable. This expansion, 


s1ltzgt+Z4+2t--= oz, (6.100) 


n=0 


— 


IN 


converges absolutely for all |z| < 1, a fact easily shown by applying Equation 6.95, 
the ratio test for complex series. This result derives from the m — © limit of the 
identity 

ltzt+2teotee tet = —s— ‘ (6.101) 


which can be proved by multiplying both sides of this expression by (1 — z), or by 
the long division shown in Figure 6.11. 
6.8 THE COMPLEX TAYLOR SERIES 


Similar to a real Taylor series, a complex Taylor series attempts to expand a complex 
function around a point, using a sum of complex terms: 


{O= eGo y Set ec az yteee ey to (6.102) 
n=0 
The c,, coefficients and the region of convergence of the series depend on the func- 


tion f(z) and the expansion point z,. We will show how these coefficients and the 
region of convergence can be obtained in two ways, first by using Cauchy integra- 
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tion techniques, and then by applying the basic complex Taylor series developed in 
Section 6.7.3. 


6.8.1 The Taylor Coefficients from Cauchy Integration 


The process of determining the coefficients from a Cauchy integration begins by 
selecting an expansion point z, and a counterclockwise, circular contour C centered 
around the expansion point, as shown in Figure 6.12. The function f(z) and its analytic 
properties are presumed to be known. Assuming f(z) is analytic everywhere inside C, 
Cauchy’s Integral Formula (Equation 6.81) states that 


1 fe) 


Se ans. / 
I@ = 75 uae ae 


, (6.103) 


where z is any point inside C, and z’ is the complex variable of integration that lies 
along C. z, can be inserted into the denominator of the integrand and manipulated as 
follows: 
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(6.104) 
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Because |z — z,| < |z' — z,| for all z inside C, the basic complex Taylor series of 
Equation 6.100 implies that 


= * -o(E —) (6.105) 
S40 z! 


= a 
2%, n=0 
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Figure 6.12 Contour for Taylor Series Coefficients 
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which converges for all z inside C. Substitution of Equation 6.105 into 6.104 gives 


_ i ,| £@) G(z-z\" 
fo = 55 p dz fe » (2). (6.106) 


—o n=0 


We will now interchange the order of integration and summation in Equation 6. 106. 
It is not obvious that this is mathematically allowed, because we are manipulating 
an infinite series. An important theorem, however, says that the integration and 
summation of a series, or for that matter the differentiation and summation of a 
series, can be interchanged if the series in question converges uniformly. Uniform 
convergence is a property of infinite series of functions of one or more independent 
variables. We will not cover this topic in this text, but we refer you to almost any book 
on advanced calculus. See, for instance, Kaplan, Advanced Calculus or Sokolnikoff 
and Redheffer, Mathematics of Physics and Modern Engineering. The interchange 
of the integration and summation gives 


Pe 


1 (z') 
L@ = s Ee fai! ot =| G2)" (6.107) 


n=0 


Comparing Equation 6.107 with Equation 6.102, the general form of the Taylor 
series for f(z), identifies the coefficients as 


f@’) 
c, = wa fe i Gane (6.108) 


Using Equation 6.83, the general form of Cauchy’s Integral Formula, the coefficients 
can be recognized as 


: (6.109) 


which is a satisfying extension of Equation 6.97 for the coefficients of real Taylor 
series. These coefficients will generate a complex Taylor series that will converge to 
f(z) for all z inside C. The convergence properties of the series are best demonstrated 
by the example that follows. 


Example 6.4 As an example of this process, let’s obtain the coefficients of the 
Taylor series for the complex function 


{@ = (6.110) 


Z—a 

This function is analytic everywhere in the complex plane, except at z = a. The 
Taylor series is to be expanded about z = z,. These points are shown in Figure 6.13. 
The circular path for the Cauchy integration is also shown in this figure. It is centered 
at z, and has a radius such that | /(z ~ @) is analytic everywhere inside. In other 
words, if z’ is a point along the contour, |z' — z,| < a — z,|. The coefficients for the 
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Figure 6.13 The Complex Plane for Determining the Taylor Coefficients for 1/ Z-@ 


Taylor series can now be obtained from Equation 6.109 as 


fn n! dz 
: foe (6.111) 
1 n 
- (3 = =) 
so that 
1 oo 1 nt+1 
= yo (-z)". (6.112) 
z-@ 2, (3 r= 2) a 


This series converges in the crosshatched region of Figure 6.13 where |z — z,| < 
lz’ —z,|. In that same figure, we have drawn a dashed circle, centered at z, and passing 
through the singularity at a. This circle, which we call a singularity circle, is the largest 
radius that the contour C can have, while still keeping f(z) analytic everywhere inside. 
Consequently, the region of convergence for the series in Equation 6.111 can be made 
to extend up to the singularity circle, as shown in Figure 6.14. Ifa value of z is chosen 
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Figure 6.14 Region of Convergence 
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Figure 6.15 The z, Dependence of the Convergence Region 


that lies inside this singularity circle, and is inserted into the LHS of Equation 6.112, 
we will obtain the same finite value that we get from inserting this same z into the 
RHS. On the other hand, if we select a point outside of the singularity circle, and 
use it in the same equation, the LHS will give a finite number, while the RHS will 
diverge. 

If we have amore complicated function, with more than one nonanalytic point, the 
region of convergence is inside the smallest of all the singularity circles. Also, notice 
that the Taylor coefficients and the region of convergence depend on the choice of z,. 
By selecting different z,’s, different Taylor series can be generated. Figure 6.15 shows 
two different expansion points and their corresponding regions of convergence. In 
this way, a Taylor series can be generated for this function that is valid for any z, 
except z = a. From this picture it can also be seen that no single Taylor series can 
be made to converge in a region that surrounds a singularity point. This will be an 
important point in our discussion of Laurent series. 


6.8.2 The Taylor Coefficients Using the Basic Taylor Series 


Many times it is possible to determine the Taylor series coefficients and the region of 
convergence without using the technique described above, but instead by manipulat- 
ing f(z) to be in the form of the basic Taylor series given in Equation 6.100. This is 
best demonstrated by the following examples. 


Example 6.5 Again consider the function 


f@=—, (6.113) 


za 


where a is a complex constant. Dividing the numerator and denominator by —a gives 


-l/a 
1-z/a 


f@® = (6.114) 
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As long as |z/a| < 1, the denominator can be expanded using Equation 6.100 to give 


(6.115) 
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But Equation 6.115 is already in the form of a complex Taylor series, with an 
expansion point around the origin: 


ar yar (6.116) 


nt+1 
c,=- (;) (6.117) 
a 


Convergence of this series can be checked by a ratio test 


with 


+1 
tim [S412] < 1, (6.118) 
no pz” 
which gives the condition 
\z| < jal, (6.119) 


which you might have guessed from the condition on the series expansion used in 
Equation 6.115. 

Notice the convergence region is a circle centered at the expansion point with a 
radius of |a|. This confirms the previous discussion which led to Figure 6.14. 


Example 6.6 Now let’s seek a Taylor series for the same function, but this time 
expanded about an arbitrary point z = z,. The series must take the form 


1 +00 
= Slee ~ z,)". (6.120) 
Z-a a 
= ena n=0 
If we manipulate the function as follows: 
1 1 ( -1 ) 1 
= = ———= ]. (6.121) 
ga (e=2,)~a-— Zz) a-Z, (=) 
we can use the same technique as before, and write 
_ ex ae 2 
L =( L\ i+ es Boe, (6.122) 
zZ—-a a-%, a-z, (a@-Z,) 


with the region of convergence 


fon _ 2g (6.123) 
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which is the same region depicted in Figure 6.14. Thus we find the coefficients for 


this expansion are 
nt+l1 
C= : 6.124 
Cn = - (6.124) 


Once again, the series converges in a circular region centered at z,. The conver- 
gence radius is the distance from the expansion point to the nonanalytic point a. 
Notice that these are the same coefficients and region of convergence we obtained 
using the Cauchy integration approach. 


6.9 THE COMPLEX LAURENT SERIES 


A complex Laurent series has the form 


oo) 


I= » (oA Cea oe et (6.125) 


n=—o% 


and is distinguished from a Taylor series because it contains negative n terms. As 
with the Taylor series, the coefficients and the region of convergence depend upon 
the function f(z) and the expansion point z,. These coefficients and the region of 
convergence for the Laurent series will be obtained from a Cauchy integral approach, 
and also by using the basic complex Taylor series developed in Equation 6.100. 
Laurent series are useful when an expansion is desired that converges in a region 
surrounding a nonanalytic point of f(z). 


6.9.1 The Laurent Coefficients from Cauchy Integration 


Imagine we wanted to generate a series expansion for the function 1 /(z— a) about z,, 
as we did in the previous section, but this time, we want an expansion which is valid 
in a region outside of the singularity circle of a. We cannot use the contour shown 
in Figure 6.16, or any such contour which surrounds the singularity, and expect to 
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Figure 6.16 Invalid Contour for Expanding for 1/(z — a) around z, 
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Figure 6.17 Contour for Calculating Laurent Coefficients 


generate a Taylor series using Equation 6.108, because that equation requires that 
f(z) be analytic at all points inside C. 

~ Instead, consider the contour shown in Figure 6.17. In the crosshatched region 
interior to C, the function f(z) is analytic. Therefore, according to Cauchy’s Integral 
Formula, 


fo= sof af 22 
= 2m Jo ~ 2'-z 


Using contour deformation, the two straight-end pieces of C can be brought arbitrarily 
close together, so their contributions to the integral cancel. Therefore, we can rewrite 
this integral as 


(6.126) 


I } 
{@= maf. gpl NT pte (6.127) 
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where C; and C, are shown in Figure 6.18. Notice the sneaky introduction of a minus 
sign into the contour C2, so both contours are now counterclockwise. Again, we 
require C, and C, to be circular, so we can use a similar expansion to the one used 
with the Taylor series. 
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Figure 6.18 Double Circular Contour for Laurent Coefficient Integration 
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Next manipulate the integrals of Equation 6.127. First insert z, 


1 ; f(z’) 
£@ = of ec 2) (sz) 
1 f(z’) 
Ee / == 6.12 
im f,? et tel eee a 


and then generate quantities that can be expanded using the basic Taylor series of 
Equation 6.100: 


1 f(z') 
fo = 55 $ dz ie 
we 2m Se @— 2) [1 - £2 


l (z') 
ai dz’ f ‘ z'-z, | 


(6.129) 


We have manipulated the two denominators differently in anticipation of the series 
expansions about to be performed. On C, |z — z,| < lz’ — z,|, and we use the 
expansion 


fo) 


7 £7 %, 
= De a (6.130) 


Zaz 


On C2, |z — z,| > |z' — z,|, so a different series is used: 


ogee Ve (6.131) 


— 
When these expansions are substituted into Equation 6.129, and the orders of the 
integrations and summations are reversed, the result is 


aitial ; ‘ 
ii = d E f dz GZ! 7 | (-z,)" 


+ ye E f dz! f(z'\(z' — z,) | @-zyt (6.132) 


m=0 


Equation 6.132 is not quite in the form we are seeking. The first summation, involving 
the integrals over Cj, is fine and can be identified with the n = 0 terms of the Laurent 
series. In the second summation, m goes from 0 to +. If the substitution m = ~—n—1 
is made, this summation can be identified with the n < 0 terms of a Laurent series. 
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Equation 6.132 can be rewritten as 
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n=0 
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Comparing Equation 6.133 with Equation 6.125, the general expression for the 


Laurent series, the c, for n = 0 are given by integrals over C; 
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while the c, for n < 0 are given by integrals over C2 


i f@’) 
ae z dz Gayl forn <0. (6.135) 


The Laurent series that is generated by these coefficients will be valid in the 
annular, crosshatched region of Figure 6.18. This region can be expanded by shrinking 
C, and expanding C2, while keeping f(z) analytic between the two circles. Therefore 
C> can be shrunk to the singularity circle passing through z = a, and C, can be 
expanded until it runs into another singularity of f(z). If b is the first such singularity, 
the convergence region would be as shown in Figure 6.19. If f(z) has no other 
singularities, C, can be expanded to infinity. ~ 

Once we have determined the maximum contour C, and minimum contour C;, 
as described above, the coefficient expressions in Equations 6.134 and 6.135 can be 
combined into a single expression, 
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Figure 6.19 Convergence Region for the Laurent Series Expansion Between Two Nonana- 
lytic Points 
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Figure 6.20 Complex Plane for Determining the Laurent Coefficients 


where C is any contour which lies entirely within the two expanded contours. This 
last simplification is a direct consequence of the deformation theorem for analytic 
regions. 

For n < 0, the only nonanalytic points of the integrand in Equation 6.136 are 
those associated with f(z). For n = 0 the integrand has singularities associated with 
f(Z) as well as those generated by the (z’ — z,)"*! factor in the denominator. Notice 
that if f (z) is analytic everywhere inside C, all the c, for n < O must be zero, and the 
Laurent series degenerates to a Taylor series, as expected. 


Example 6.7 As an example of this process, we will determine the coefficients of 
the Laurent series expansion for the complex function 


1 2z-a-—b 

HE 2st C= Oe= By (6.137) 
This function is analytic everywhere in the complex plane, except at z = aandz = b 
The Laurent series is to be expanded around the point z = z,. These points are shown 
in Figure 6.20, where we have assumed that a is closer to z, than b. For this example, 
the contour C used for determining the Laurent coefficients from Equation 6.136 must 
be centered at z,, but can be located in three general regions: inside the singularity 
circle of a, between the a and b singularity circles, or outside the b singularity circle. 
These three situations will be treated separately. 


[@ = 


The Contour C Inside the Small Singularity Circle The case of the C contour 
lying inside the singularity circle of a is shown in Figure 6.21(a).The coefficients of 
the Laurent series are obtained from the Cauchy integration of Equation 6.136 as 


2z'-a-—b 
= oi $42 ~ (2! — al(z' ~ bz! -— z (2! — az! ~ bY! — z, (6.138) 


For n < 0, the integrand of Equation 6.138 is analytic inside C and so for n < 0 all 
the c, are zero. For n = 0, the only nonanalytic point of the integrand is at z,, and so 
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(a) 
Figure 6.21 Complex Plane for C Inside the Small Singularity Circle 
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1 n+1 1 nt+1 
-(+) -() n=0. (6.139) 


Therefore, for this case, the series expansion becomes 


Died diss ( 1 y+)" =z)". 6.140) 
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The region of convergence for this series can be determined by finding the largest 
and smallest circles around the point z, that have f(z) analytic everywhere in between. 
In this case, the largest circle we can have lies just inside the singularity circle that 
passes through a. Because there are no other nonanalytic points inside this circle, the 
smallest circle can be collapsed to a circle of radius zero. Thus the series converges 
for the entire crosshatched region shown in Figure 6.21(b). 

In this case, it can be seen that the series in Equation 6.140 is not a Laurent series 
at all, but just a simple Taylor series with the standard Taylor series convergence 
region. This will always be the case if f(z) is analytic inside C. 


The Contour C Between the Singularity Circles Now let’s look at the more 
interesting situation where the contour C lies between the singularity circles, as 
shown in Figure 6.22(a). The coefficients for the Laurent series can be obtained by 
Cauchy integration using an expression identical to Equation 6.138, except C is now 
between the singularity circles 


2z'-a-b 


—— / 
cae QT gaz (2! — ay(z! — byz' — zytr (6.141) 
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(a) 
Figure 6.22 Complex Plane for C Between the Singularity Circles 


For n < 0, the integrand of Equation 6.141 is no longer analytic inside C because of 
the singularity at z = a. Therefore, 


1 n+l 
C, = ( ) forn < 0. (6.142) 


=n 
a> %, 


For n = 0, there are two singularities inside C, one at z, and another at a. Therefore, 
after some work, we obtain 


1 n+l 
Cn = ¢ ) forn = 0. (6.143) 


27 %, 


For this case, the series expansion becomes 


2z-a-b _ = 1 n+1 . 
Pa ee) Se 


=—o00 


ee 1 n+l 
+>-(5) ( — z,)". (6.144) 


The region of convergence for this series is obtained by determining the largest 
and smallest circles, centered around z,, that still have A (z) analytic everywhere in be- 
tween. In this case, the largest circle is just inside the singularity circle passing through 
b, while the smallest is the circle that lies just outside the singularity circle passing 
through a. The resulting annular convergence region is shown in Figure 6.22(b). 

In this case, we have a true Laurent series with n < 0 terms in the expansion. 
Notice how the terms for n = O make up the Taylor series expansion for the 1/(z — b) 
part of f(z), while the n < 0 terms are a Laruent expansion for the 1/(z — a) part. 
The n < 0 terms diverge inside the singularity circle that passes through a, while 
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the n = 0 terms diverge outside the singularity circle that passes through b. The 
combination only converges between the two singularity circles. 


The Contour C Outside the Large Singularity Circle As a final variation on 
this expansion, consider the situation where the contour C lies outside the large 
singularity circle that passes through b, as shown in Figure 6.23(a). The coefficients 
for the expansion can be obtained by a Cauchy integration using an expression 


identical to Equation 6.138, except C is now outside the singularity circle that passes 
through b: 


1 2z/-a-b 
c, = == dz’ =____— ‘ 6.145 
“a Qn $ 4 (z! — az! — by(z' — z,)"*! ( : 
For n < 0, the integrand of Equation 6.145 has two singularities, one at z = a and 


the other at z = b, and we have 


1 n+] 1 nt+{ 
Cc, = ( ) + ( ) for n < 0. (6.146) 
a-Z, b-zZ, 


For n = 0, there are three singularities inside C, at z = a, z = b, and z = z,. After 
more work than last time, we find the negative coefficients are all zero: 


c, =0 for n = 0. (6.147) 
The series expansion, therefore, becomes 


2z-a~b = 1 \t 1 \ntl : 
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As before, the region of convergence for this series is obtained by finding the largest 
and smallest circles centered at z, that keep f(z) analytic between them. In this case 
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Figure 6.23 The Complex Plane for C Lying Outside the Large Singularity Circle 
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the smallest circle lies just outside the singularity circle that passes through z = b. 
The largest circle can be extended to infinity because there are no other nonanalytic 
points of f(z). Therefore, the convergence region is as shown in Figure 6.23(b). 

In this case we again obtain a true Laurent series. The coefficients of the series 
contain two terms. The first generates a Laurent series for 1/(z — a) and the second 
a Laurent series for 1/ (z — b). The 1/ (z — a) part of the series converges outside the 
singularity circle that passes through a. The 1 /(z — 9) part of the series converges out- 
side the singularity circle that passes through b. The combination converges outside 
the largest of the two singularity circles, the one passing through b. 


6.9.2 Laurent Coefficients Using the Basic Taylor Series 


Evaluating series coefficients from Equation 6.136 is not always an easy task. For- 
tunately, like the Taylor series, Laurent series often can be generated from the basic 
series expansion of Equation 6.100. This is best demonstrated by an example. 


Example 6.8 Consider the function 


fO= (6.149) 


which is analytic at every point except z = a. We seek a Laurent series for this 
function expanded about z, = 0 in the form 


1 
= Vi ez". (6.150) 
We can manipulate Equation 6.149 using Equation 6.100 to obtain 


Ole OPE] 


This is in the form of a Laurent series with only n < 0 terms. The coefficients of 
Equation 6.150 are 


_ fo n20 
Cc, = (os peo? (6.152) 


Because the series expansion in Equation 6.150 converges if 
lzl > lal, (6.153) 


this Laurent series is valid in the region shown in Figure 6.24. This is the region 
outside the singularity circle that passes through the point a. There is no outer limit 
to this convergence region. 
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Figure 6.24 Convergence Region for Laurent Series Expansion About z, = 0 


The procedure for an expansion about an arbitrary point z = Z, is similar. Manip- 
ulate Equation 6.149 as 


J 1 
(a) eae) (6.154) 


Expanding the second term gives 


1 i — —z\* 
=( ) [p+ SEB + (8) ve, (6.155) 
at aS ae ee ee Se 


which again is a Laurent series with coefficients 


_ fo n=O 
C, = eee 226: (6.156) 


The series converges if 


(6.157) 


jg- zl |e=<z, 


7 


| imag 


z-plane 


ee 


Ui 


Figure 6.25 Convergence Region for Laurent Series Expanded about z, 
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which is a region outside the singularity circle centered at z, and passing through 
a, as shown in Figure 6.25. Notice, the region of convergence can be modified by 
changing the expansion point z,. As z, gets closer to a, the series becomes valid for 
more and more of the complex plane, but can never be made to converge exactly at 
Z=a4. 


6.9.3 Classification of Singularities 


Laurent expansions allow singularities of functions to be categorized. Imagine the 
point z = a is a singularity of the function f(z). If the most negative term in the 
Laurent series expansion around the singular point is —m, the singularity is called a 
pole of order m. For example, the function 


1 
cea 


LO = (6.158) 


is its own Laurent expansion around z = a, and thus this function has a pole of order 
one at that point. This is often called a simple pole. The function 


f= a (6.159) 


ay 


again is its own Laurent Expansion around z = a and therefore has a pole of order 
three. 

It is possible to have a function that appears to have a pole, but on closer exami- 
nation does not. An example is the function 


f@ = me (6.160) 


This function has a singularity at the point z = 0, because the denominator goes to 
zero, but the Laurent expansion around z = 0 has no negative terms. This type of 
singularity is called removable because the function does not diverge as we approach 
the singularity. This can be seen in the case of Equation 6.160 because 


_ sin(z) 
lim =— = 
z70 Zz 


1, (6.161) 


even though the value exactly at z = 0 is undefined. 

It was quite easy to determine the order of the singularities in Equations 6.158 and 
6.159, because they were in the convenient form of 1/(z — a)”. For a singularity of 
a more complicated function, you might think we would always have to determine 
the coefficients of the Laurent series. Fortunately, there is another approach, which is 
often much quicker. Notice for a pole of order m at z = a, multiplication by the factor 
(z — a)” will get rid of the divergence at a. In other words, if the original function 
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diverges because of a pole of order m at z = a, i.e., 


lim f@ — £%, (6.162) 


the divergence can be removed by multiplying by the factor (z — a)” 


lim [f@(z — a)"] — a finite value. (6.163) 
2a 


Thus, one way to determine the order of a pole is to multiply by successive factors 
of (z — a), until the result no longer diverges at a. The number of multiplying factors 
needed determines the order of the pole. 

A final type of singularity exists if the function diverges at z = a and this diver- 
gence cannot be removed by multiplying by (z — a)” for any value of m. This is called 
an essential singularity. The Laurent expansion for a function about an essential sin- 
gularity point will have an infinite number of negative n terms. The function e \/2 has 
an essential singularity at z = 0. This function goes to infinity at z = O faster than 
any power of z goes to 0. 


Example 6.9 To illustrate this technique, consider the function 


‘4 


— Ss (6.164) 
cosz — 1 


{[@ = 


There are singularities everywhere cosz = 1. Let’s consider just the singularity at 
z=0. 

First we will demonstrate that the function diverges as we approach z = 0. Notice 
that both the numerator and denominator are zero at this point, so we must use 
1’ Hopital’s rule to determine the limiting behavior of the function. The complex 
version of |’ Hopital’s rule is identical to the version for real functions: 


L@)_ [df @/dz 
= 6.165 
Pe 48] - ae ova are 


as long as both 
lim [f,2] = 
lim [f,@] =. (6.166) 


Applying this to the z = 0 singularity of Equation 6.164 gives 


br a = lim (<2) = lim (=) > ~o, (6.167) 
z0 —1 z—0 \ sinz 
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Now let’s determine the order of the pole. Start by multiplying the function by z 
and taking the limit: 


Zz 0 
li = hi = : 6.168 
lim [f@z| = lim (see i) | ir (6.168) 
To determine this limit, use 1’ Hopital’s rule: 
2 
. Zz ; d(2)/dz . —2z 0 
1 = =] = = = —=)j=- 6.169 
0 (ae = i) ae Fes Spf ols) oS 
not once, but twice: 


2 
: z _ [d(—2z)/dz ; =), 

1 —=— ] =] =~_=| = lim = —2., 6.170 
sa (<2 a ;) 30 ano | 20 (=) ( ) 


Multiplying by z has removed the divergence, so the singularity at z = 0 is a pole of 
order one. 


6.10 THE RESIDUE THEOREM 


It could be argued that the most useful result of this chapter is the Residue Theorem. 
This theorem provides a powerful tool for calculating both complex contour integrals 
and real, definite integrals. 


6.10.1 The Residue of a Single Pole 


Consider an integral of w(z) on an arbitrary closed contour in the z-plane: 


l= $ deme (6.171) 
c 


Assume that w(z) has one singularity inside C, and that w(z) can be expanded in 
a Laurent series about a point z,, such that the contour C is entirely within the 
convergence region, as shown in Figure 6.26. Notice that a Laurent series is necessary 
because w(z) is not analytic everywhere inside C. Consequently, the series can be 
substituted for w(z) in the integral 


is $ dz >) 6G =z)! (6.172) 
c 


n=—0 


Reversing the order of summation and integral gives 


iS Pa fp dzaate ~z,)". (6.173) 
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Convergence 
region 


Figure 6.26 Contour and Convergence Annulus for Residue Theorem Integration 


Consider any one of the terms with n = 0. Its contribution to J must be zero 
because the integrand is analytic everywhere: 


$ deez =z)"=0 n=0. (6.174) 
fol 


Now look at the n = —2 term. According to Cauchy’s Integral Formula, 


C2 ac 
dz-——-= =2 
f 2G -—<) dz 


to 


= 0, (6.175) 


so the contribution of this term to J is also zero. This will be the case for all terms of 
the series with n = —2. For n = —1, however, 


I= ¢ dzw(z) -4 £1 gz =2nic_,, (6.176) 
Cc cli %& 


where again, the last step follows from the Cauchy Integral Formula. 
The c_, coefficient is referred to as the residue at the singularity and, in general, 
the integral’s value is 


¢ dzw(z) = 2a X the residue. (6.177) 
Cc 


6.10.2 Residue Theorem for Integrals Surrounding Multiple Poles 


The extension of the above arguments to cover functions with several poles is straight- 
forward. Consider the complex function 


f@ 


; 6.178 
(z — a(z — bz -— ©) ‘ ) 


w(Z) = 
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real 


Figure 6.27 The Residue Theorem Applied to Multipoled Functions 


where f(z) is analytic in the entire z-plane. There are three poles at z = a, z = b, and 
z = c, Imagine we are interested in evaluating an integral 


| ¢ dzw(2), (6.179) 
Cc 


where C encloses two of the poles of w(z), as shown in Figure 6.27. Because the 
Cauchy Integral Theorem allows us to arbitrarily deform the contour through analytic 
regions, the integral can be broken up into the sum of two individual integrals, one 
around each pole: 


L= f dzw() = p dzwz)t+ p dzwi), (6.180) 
Cc Cc C2 


as shown in Figure 6.28. Therefore, the value of the integral will be 277i times the 
sum of the residues of w(z) at a and b: 


g dz f@ = 2mi[residue at a + residue at b] (6.181) 
c (%— az — bz - ©) ue at a + residue at 9]. 


The general form of the Residue Theorem for a function with an arbitrary number 
of poles is 


¢ dzw(z) = 277 S> Residues of poles inside C. (6.182) 
Cc 


imag 


Cc 


e 
Oe 
real 


c 


Figure 6.28 Equivalent Multipole Contours 
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6.10.3 Residue Formulae 


Now that we have shown why residues are important, we need to develop some 
methods for calculating them. We showed earlier that the residue of a singularity is 
just the c_, term in a Laurent series expanded around the singularity itself. Thus, we 
can always determine the residue of a singularity once we know the Laurent series. 
Sometimes this is the only way possible, but in many cases we don’t have to do that 
much work. 

Imagine we already know that a singularity at z = a is a first-order pole. The 
Laurent series for such a pole, expanded about the pole, looks like this, 


w= Co tetee—atee—a@t--, 6.183) 


with no terms for n < —1. Multiplying this series by (z — a) gives 
WZ ~ a) = ¢_, + -a)+¢,(¢~ 9) +a@- a) +: (6.184) 
If then we take the limit as z — a, we find that 


lim[w(2)(z —a)] =c_}. (6.185) 


This gives a very simple method for determining the residue of a first-order pole. As 
an example, consider the function 
sin z 
= —~—>+—., 6.186 
WO = Gag -D acs 
which has first-order poles at both z = a and z = b. Let’s calculate the residue at 
2-4: 


sina 
lim (z)\(z — a)] = lim ape a = ——_, 6.187 
a @-ae-b* @—b) enn 
This method can be extended for the residues of higher-order poles. The Laurent 
expansion of a second-order pole located at z = a, expanded around the pole, has the 
form 


2h 16-9 c-1 - at 
w(Z) = ar + G=a) +e,+e(Z-a)t--. (6.188) 


Multiplication by (z — a)” gives 
w(z\z - a)? =¢ + ¢.,(¢~at+e,(z-a* +: (6.189) 


Now we take the derivative 


< [w(z)(z — a)”] = c_, + 2eg(z — a) + 3e\(z- a)? + -, (6.190) 
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and finally take the limit as z — a: 

_ 4d 2 

lim az (w(z\z — a)”| = c.). (6.191) 


This gives us a simple formula for calculating the residue of a second-order pole. For 
a pole of order n located at z = a, Equation 6.191 generalizes to 


1 ; n—-I P 
f1 = Go im ae [w(z\(z — a)"] . (6.192) 


6.11 DEFINITE INTEGRALS AND CLOSURE 


The residue theorem can be used to evaluate certain types of real, definite integrals 
which appear frequently in physics and engineering problems. The method involves 
converting a real integral into a complex contour integral and then closing the con- 
tour. The residue theorem can then be used to evaluate the contour integral, and 
subsequently the original definite integral. 

Let x be a pure real variable, and consider an integral with the form 


+00 
I -/ dx w(x), (6.193) 


where w(x) is a real function of x. Even though this integral involves quantities that 
are all pure real, it can be evaluated with x represented by a complex variable z by 
performing the integration along the real axis of the complex plane. That is 


T= / dx w(x) = [acm (6.194) 
—w R 


where R is a straight-line path along the real axis from z = —©toz = +, as shown 
in Figure 6.29. To proceed, we need to define the function w(z) for values of z off 
the real axis. We can do this with a process called analytic continuation. Basically, 
this involves the determination of a complex function which, along the real axis, is 
the same as the original real function, and is analytic in as much of the complex 
plane as possible. In general, the continuation of a real function can be performed by 


imag z-plane 
real 


R 


Figure 6.29 Equivalent Path for Real Integral 
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z-plane 


Figure 6.30 Semicircular Contour for Closure 


simply replacing all the x’s in a function with complex z’s. For example, the analytic 
continuation of w(x) = x? would be w(z) = 2. 

Because R is not a closed path, the residue theorem cannot be used to perform the 
complex integration of Equation 6.194. However, suppose 


| dzw(z) = 0, (6.195) 
nN 


where 1 is a semicircular path of infinite radius in the upper half z-plane, as shown in 
Figure 6.30. Because this additional section cannot change the value of the contour 
integral, we can just add it to the original complex integral to form a closed contour. 
That is, if the integral along M is zero, then 


[= [awe = fazma + | dzw{z). (6.196) 
R R n 


The combination of the R and M paths form the closed contour C shown in Figure 6.31. 
The original definite, real integral can now be written as a closed integral that can be 
evaluated by using the residue theorem: 


+06 
/ dx w(x) = $ dzw() 
—oo Cc 


= 2mi ) > Residues of w(z) inside C. (6.197) 
imag 
z-plane 
Teal 


Figure 6.31 Closed Contour for Definite Integral 
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The technique of adding a section of contour to form a closed contour is referred 
to as closure. In the discussion above, closure was accomplished by adding a coun- 
terclockwise, infinite radius semicircular path in the upper half plane. Sometimes 
closure is accomplished by a similar semicircular path in the lower half plane. Clo- 
sure is most often performed with semicircles, but there are cases where rectangles 
or other shapes are necessary. 


6.11.1. Conditions for Closure on a Semicircular Path 


As mentioned, for Equation 6.197 to be valid, the integral along ™ must be zero. 
This integration can be viewed as a semicircular contour, as shown in Figure 6.30, in 
the limit of the radius approaching infinity. If we let r be the radius of the semicircle, 
then on M 


Z= re’? (6.198) 
where 0 < @ < a for a semicircle in the upper complex plane. To move along M, 
dz = ire'®d@, (6.199) 


and the integral on M becomes 
oz . . 
| dzw(z) = lim : 0 ire’ w(re"®). (6.200) 
nN r--¥00 0 


Thus to successfully use closure in the form described in the previous section, w(z) 
must obey 


lim | dO ire’ w(re®) = 0. (6.201) 


re 


Using Equation 6.201 can be a little unwieldy, but a more usable condition can be 
derived by using the inequality 


| faz£0 


which holds for any complex integral. Applying this to Equation 6.201, and noting 
le®| = 1, gives 


7 ih ldz fl, (6.202) 


lim y d0 ire’ w(re"®)| < tim | ld@ rw(re’®)| 
r+} Jo re Jo 
=a fim {7Wrrax], (6.203) 


where Wray is the largest value of |w(z)| on the interval 0 < 9 < 1. This gives us a 
useful result. If W,,,; shrinks to zero faster than 1 /r, then the contribution of the M 
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contour necessarily must vanish. Notice, it is not enough that |w(z)| + 0 as r > ~, 
Because of the extra factor of r, |w(z)| must go to zero faster than 1/r. 


Example 6.10 As an example of this closure technique, consider the real integral, 


T= / a (6.204) 


owt 


The value of the integral is equal to the area under the curve shown in Figure 6.32 and 
is clearly a positive, real quantity. It is easy to extend the integral into the complex 


plane 
l= i: “y (6.205) 
R& 


where the contour R is along the real z-axis, as shown in Figure 6.29. Now consider 
the M integral of Figure 6.30: 


- dz 
In = [ PHT" (6.206) 
Onn, 
z= re® (6.207) 
dz = ire'’d@ (6.208) 
and 
U. . 1 
2+ rre?24 +] 
1 1 
=5 (ae ra 7a) ; (6.209) 


mats VE: cs 


Figure 6.32 Real Integral to Be Evaluated by Closure 
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—le 


Figure 6.33 The Closed Contour for the Integration of 1/(x? + 1) 


This falls off faster than 1/r for all values of 6, so in the limit as r —> ©, J, = 0, and 
the original integral is equivalent to 


“dx dz dz 
3 a rarer 24 
ae kot dl Rati nvtil 


= ¢ de (6.210) 
Cc 


2+1° 


where C is the closed contour shown in Figure 6.33. The two poles of the integrand 
at +7 have also been indicated in this figure. From the residue theorem, the value of 
the original integral must be 


ax : : i te 
: aa QT » Residues of ror inside C. (6.211) 


Only the z = / pole is inside C, and a quick calculation shows the residue there is 
1/2i. Thus we obtain the final result 


= | 1 
/ ni 7?! (5) (6.212) 


This particular integral could also have been closed with a semicircle in the lower 
half of the complex plane, as shown in Figure 6.34. In this figure, we see that now 
the z = ~i pole is enclosed and 


eg 
i. a = —27i(Residue at — i). (6.213) 
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+14 z-plane 
Teal 


Figure 6.34 An Equivalent Closed Contour for the Integration of 1/@? + 1) 


There is a minus sign out front because we enclosed the pole in a clockwise sense. 
The residue of this pole is —1/2i so the value of the integral becomes 


° dx -1 
= -27i{ —) = 21 
[25 2ni (=) 1m, (6.214) 


which, fortunately, is the same as before. 

Not all real integrals can be closed in both the upper and lower half-plane, as 
this one could. Some functions diverge in one half-plane and thus force the closing 
contour to be drawn in the other. Some nasty functions diverge on both sides, and 
make evaluation with this type of contour impossible. 


6.11.2 Closure with Nonzero Contributions 


In some cases we can get away with closing a contour, even if the integral along the 
added segment is not zero. As before, we extend the real integral into the complex 
plane. When we evaluate the integral, however, we must subtract off the added 
contribution, 


/ dx w(x) = [azwo = daw) = ‘h dzw(z), (6.215) 
2 R Cc n 


where C is a closed contour, which includes both the path R along the real axis and 
the closing path. 


Example 6.11 As an example, consider the definite integral 


00 er/2 
I= dx —_—____. 21 
ie = 1 + coshx 6219) 


As usual, the conversion to a complex integral is simple: 


tte sel i ee 6.217 
[hoa | 21+ coshz’ eer) 
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If we try to close the contour with an infinite radius semicircle on either side of the 
real axis, we run into problems, because the complex integrand does not vanish as the 
radius approaches infinity. This can be seen most easily by evaluating the integrand 
along the imaginary axis, i.e., z = iy, where we find 


ee/2 Qeiv/2 


1+coshz] 2+e% +e" 
aly 


(6.218) 


Notice as y — +c, the RHS of Equation 6.218 oscillates. Therefore, closing the 
contour with a semicircular loop as we did in the previous section will not work. 

By examining the poles of this function, you can see another big problem with using 
the infinite semicircular contour. A pole occurs every place where z = +(2n + l)ia 
for all integer values of n. That means there are an infinite number of poles along the 
imaginary axis which would be enclosed by such a contour. 

A better method is to close the contour as shown in Figure 6.35, and then take 
the limit x, — . This contour encloses only a single pole, the one at z = ia. The 
integral around the contour C, in this limit, has four parts: 


e2/2 Xo ee/2 
dz ——_dz= li dz—_—___ 
t 21+ cosh z ae age ic 21+ cosh z 


Xo ti2a ee/2 
a dz-—_—_—__ 
‘ ~ 1+ coshz 


° (6.219) 


Xo +i2T e/2 
ee 
tide 1 + coshz 


0 


—Xo z/2 
e 
+/ dz ————— ?. 
—Xyti2n “1+ cosh z 


For the right vertical segment, z = x, + iy and dz = idy, so 


Xo ti2a z/2 27 » Xo /2piy/2 
e 2ie*’“e'y 
dz ————__ = d - =. 6.220 
[ 4 1+ coshz 7 2 + ere'Y + @ Xe l¥ ( ) 


imag 


+210 


Figure 6.35 Contour for Integration of e/2 /(A + cosh x) 
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AS x, — +, the e* term in the denominator dominates and forces this integral to 
zero. The same thing happens for left vertical segment. The integral along the top, 
horizontal segment (from x, + i2a to —x, + i27) can be expressed in terms of the 
integral along the bottom segment: 


—Xoti2ar ez/2 ~Xo Qerxl2ei2a/2 
dz ——_——_ = dx - ; 
Ve “1+ coshz lie 2 + ere?2™ + ete iam 


6 2e*/2 
7 / ote te any 


Xo ee/2 
= [az 
-x, 1+ coshz 


Consequently, 


ee/2 Xo ee/2 
2 hh a 6.222 
faz 1 + coshz fot 25 az 1 + coshz ( ) 


The right side of Equation 6.222 is just twice the real integral we started with, so 


* ex/2 1 ex/? 
———_—_— = = ——_—_—_—.. 6.223 
[# 1 + coshx 5 hae; + coshz ( ) 


The contour C encloses the singularity at izr, as shown in Figure 6.35. The residue 
of the integrand at izr can be determined by the techniques described earlier in this 
chapter. In this case, however, it may be simpler to look at the Taylor series for cosh z 
expanded about iz. Using Equation 6.109, the first few terms of the expansion are 


1 
coshz = —1 — a -iny - ria ce (6.224) 


This series converges to cosh z in the entire z-plane. From this Taylor series, you can 
infer that the complex integrand in Equation 6.223 has a second order pole at itr, 
because as we move arbitrarily close to this point, the denominator of the integrand 
is dominated by the (z — ir)” term in the series expansion. Equation 6.191 allows us 
to calculate the residue as 


aT 


F (—2¢2/2) = -j. (6.225) 
dz i 
Therefore, according to Equation 6.223, the value of the real integral is 


e/2 1 6 
———_—-_ == = 7. 6.22 
if o 1 + coshx 327) a ( ) 
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Figure 6.36 Closure Results from Contour Deformation 


6.11.3 Another View of the Closure Process 


There is an insightful, alternative viewpoint one can take when looking at closure 
problems. Instead of adding a contour segment to close an integral, the original 
contour, which lies along the real axis, can be deformed as shown in the sequence of 
Figure 6.36. 

Consider a real integral 


r= | dx w(x). (6.227) 


This integral converts to an equivalent complex integral along R, the real z-axis, 


/ dx w(x) = [ame (6.228) 
—oo R 


as shown in the upper left-hand corner of Figure 6.36. Two poles of w(z) are also 
shown in this figure. The contour R can be deformed without changing the value of 
the integral as long as the deformation does not cross a nonanalytic point of w(z). 
Therefore the integrals on R’ and R", shown in the upper right and lower left of 
Figure 6.36, are all equivalent to the original integral 


} dx w(x) = | w(z) dz = i dzw(z) = | dzw(2). (6.229) 
oC R Ri R" 


We can now make the vertical line segments of R” overlap so they exactly cancel, as 
shown in the bottom right of Figure 6.36. The original integral can then be expressed 
as 


l= l dx w(x) = / dzw(z) + 27 S “residues of poles crossed. (6.230) 
=, RW 
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If there are no other poles of w (z) then R” can be expanded to the infinite semicircle 
we used before. If the contribution along R” goes to zero as it is deformed to infinity, 
the result is the one found earlier in Equation 6.197. If, on the other hand, the 
contribution is nonzero, we must take that into account as we did in the previous 
section. 


6.11.4 The Principal Part of an Integral 


When the integrand of a real integral has a singularity for some value of x over the 
range of integration, the value of the integral is undefined. For instance the integrand 
of 


/ a (6.231) 
—o Xx 


behaves this way. However, since 1/x? is an odd function of x, it is tempting to claim 
that the value of this integral is zero, because the negative infinity on the left of x = 0 
is counteracted by the positive infinity on the right of x = 0. We can rescue our 
intuitive feel for this integral by defining the principal part of the integral: 


pp | at = lim aa + as : (6.232) 
ars x3 e—-0 = x € x 


In essence, the principal part takes the original integral and removes an infinitesimal 
region around the singularity. Evaluation of Equation 6.232 is straightforward: 


eel ee is ial ‘ 
-» x3 ese 2 |. 2x? 
= 
2er De? 
= 0. (6.233) 
So, we in fact obtain the intuitive result 
pp | a = 0. (6.234) 
= 505 HE 


The principal part of many real integrals can be evaluated by closure and the 
residue theorem. Consider a real function w(x) that blows up at x = a. The principal 
part of this integral is defined as 


foe} 


PP [ dx w(x) = lim He dx w(x) + i dx wea} : (6.235) 
—o e—- —o ate 
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Figure 6.37 Principal Part Integration in the z-Plane 


This is equivalent to an integral done in the complex z plane, 


pp | dxw(x) = [ dzw(z), (6.236) 
—o% PP 


where PP is a path of integration along the real z-axis from (—%, a ~ €) and (a+ €, &), 
in the limit of € — 0, as shown in Figure 6.37. This integration can be handled with 
closure techniques by adding the standard infinite radius semicircular path in either 
the upper or lower complex plane, plus a semicircular path of radius € around the 
singularity at z = a. In other words, we write 


pac w(Z) = dzw(z) + | dzw(z) + if dzw(z), (6.237) 
C PP U n 


where U is the semicircular path of radius €, M is the semicircular path of infinite 
radius, and C is the closed contour made up of PP + U + M, as shown in Figure 6.38. 

The evaluation of the closed integral over the contour C is simply equal to 277i 
times the sum of the residues of w(z) inside C. Most often, the contribution from M 
is zero. The integral over U is a bit more tricky. If the singularity on the contour is a 
first-order pole, this added contribution is always half the residue of the pole. This is 
best demonstrated with an example. 


imag 


z-plane 


Figure 6.38 Closed Contour for Evaluating the Principal Part 
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Example 6.12 Let’s evaluate 
oc 1 l-e 1 
PP 3 Eli dx -—>——_—~ 
[i (x? + 1)(x - 1) an [. *“@+Dea-1 


is 1 


Applying the above arguments, 


5 1 : 1 
pp fax @+De-) faz @+De-) 
1 
- [ae + De-1 


] 
- fae @iieoy | 


where the poles of the integrand and the various paths of integration are shown in 
Figure 6.39. 

The contribution from M goes to zero because, for large lz, the integrand falls off 
as 1/z°. The closed integral on C is 


1 
¢ dz a Cy a 2mi 5° residues inside C. (6.240) 


There are two enclosed poles, one at z = 1, with a residue of 1/2, and the other at 
z = t, with a residue of (i — 1)/4. Therefore, 


1 arb cad gg Be AVY ie 


imag 


z-plane 


Figure 6.39 Contours for PP Integration of 1 /(x? + i)(x — 1) 
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imag 


-le 


Figure 6.40 Expanded View of the U Integration 


The integration over the small semicircle U needs particular attention. Consider 
the expanded view shown in Figure 6.40 where we have drawn a phasor (z— 1) = ee!?. 
Along the path, dz = ie'dd, and ¢ ranges from 7 to 277. In the limit as e — 0, we 
have 


i = 6.242 
bes z+) 2, ( ) 
so the integral along U becomes 
1 an iee'? 
2s ee>e O d¢é——, 
is “arg ef, 7? eel 
= > (6.243) 


The value of the principal part of the integral therefore becomes 


= 1 1 
pp fas @+he-) faz @+ De) 


1 
4 eee 


Tv 
ely 6.244 
5) ( ) 


Note that the principal part in the example could have been evaluated with contours 
other than those shown in Figure 6.39. The semicircle of radius € could have been 
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taken above the singularity at z = 1, and/or the infinite radius contour could have 
been placed in the lower half of the z-plane. The result obtained for the principal part 
of the integral is the same with any of these choices. 


6.11.5 Definite Integrals with sin 6 and cos 0 
Real integrals of the form 


200 
I -[ dé w(sin @, cos @), (6.245) 
0 


where the integrand is a function of sin 9 and cos @, can sometimes be converted to 
complex integrals along a circular contour of unit radius, as shown in Figure 6.41. 
On this contour, z = e? and @ ranges from 0 to 27. Therefore, on C we can write 


do= = (6.246) 


and 


1. 
sin 8 = ae ~e%)= 5 - 1/2) 

; ; (6.247) 
cos @ = ne" + e7i8) = 5% + 1/z). 


Substituting these quantities into Equation 6.245 allows the integral to be converted 
to one in the complex plane: 


27 
. - + 
1= | d@w(sin@,cos 6) = $ I Sal REA (6.248) 
0 c iz 2i 2 
Example 6.13 As an example of this technique consider the real integral 
297 
dé 
T= >... 6.24 
[ 2+ cos 6 om) 


Applying Equations 6.246 and 6.247, this integration becomes 


2a dg ee 
a I 2+cosd “2g 244+" (6.250) 


where C is the counterclockwise contour that lies along the unit circle shown in 
Figure 6.41. The integrand has two first-order poles at z = —2 + J3. Only the pole 
at —2 + 1/3 is inside the contour, and it has a residue of 1 /2V/3), so the value of 
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Figure 6.41 Contour for Real sin, cos Integrals 


the integral becomes 


2 ie dé@ 
0 2+ cosé 


—2i 
= mi 6.251 
= (=) een) 


ee (6.252) 


V3 


6.12 CONFORMAL MAPPING 


Conformal Mapping is a technique for finding solutions to Laplace’s equation in 
two dimensions. These solutions are of interest in fluid mechanics and electro- and 
magnetostatics. 

The mapping technique uses a complex function, such as w = w(z), to take points 
from the z-plane and “map” them onto points in the w-plane. If w, = w(z,), then w(z) 
is said to map the point z, on to the point w,, as shown in Figure 6.42. In this figure, 
we have continued to use the convention that the real and imaginary parts of z are 
given by x and y, and the real and imaginary parts of w by u and v. If one point in the 


imag imag 
z-plane w-plane 
Yo I = @Z, | 
Voie Sas Poa Se WS 
real real 
= akg ~ Uy 


Figure 6.42 A Mapping of z, tow, by w = w(z) 
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z-plane maps to only one point in the w-plane, the mapping function w(z) is said to be 
single valued. It is possible for the mapping function to take one point in the z-plane 
to more than one point in the w-plane. This type of function is called multivalued. 

Mapping is a two-way street. The function w(z) can be inverted to give a function 
of w, i.e., 2(w), that takes points from the w-plane and maps them back to points in 
the z-plane. If both w(z) and z(w) are single valued, the mapping between the two 
complex planes is called “one to one.” 


6.12.1 Mapping of Grid Lines 


Many times it is easier to discuss mapping properties with the use of a specific 
mapping function. For the purposes of this discussion, consider the mapping function 


w= 2. (6.253) 
w-plane z-plane 
\v 
: y 
Ne Af, 
: i 
Soe . es => -——2—l1—}-~] -2 


Figure 6.43 Mapping of w-Plane Grid Lines onto z-Plane by w = z* 
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Figure 6.44 Perpendicular Intersection of Grid Line Mapping 


How do the grid lines in the w-plane, the lines of constant u and constant v, map onto 
the z-plane? Substituting w = u + iv and z = x + iy into this mapping function gives 


u(x, y) = xe y? 
v(x, y) = 2xy. (6.254) 


Equations 6.254 implies that lines of constant u and v represent two sets of nested 
hyperbola, as shown in Figure 6.43. These hyperbola are plotted on the same graph 
in Figure 6.44. It can be seen that the hyperbola appear to intersect at right angles. 
Indeed, we will show that this is the case and is a general property associated with 
all analytic mapping functions. Notice that the mapping is not one to one, because a 
single grid line of the w-plane maps into two lines in the z-plane. 


6.12.2 Conformal Maps 


A mapping function is said to be conformal if it preserves angles. This means that 
a conformal mapping function will map two lines that intersect at some angle into 
another pair of lines that intersect at the same angle. This is depicted in Figure 6.45, 


u 
w-plane \y z-plane 


v i K 


Figure 6.45 The Angle Preserving Feature of Conformal Mapping 
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w-plane z-plane 


Figure 6.46 Analytic Functions and Conformal Mapping 


where the intersection point z, maps into the intersection point w,. In both planes, the 
angle of intersection is a,. Notice that the lines are not necessarily straight and so the 
intersection angle, in the case of curved lines, must be defined by the local tangents. 
The w = z* mapping function just discussed appears to be conformal because the 
perpendicular grid lines of the w-plane map into hyperbola in the z-plane that intersect 
at right angles. 

All analytic mapping functions are conformal. To demonstrate this, let a mapping 
function take the point z, to the point w,, and consider a nearby point z = z, + Az that 
maps to the point w = w, + Aw, in the limit Az — 0. This is shown in Figure 6.46. 
A line that passes through the two points z, and z maps into a line that passes through 
the points w, and w. If Az = Aze!* and Aw = Awe’®, we can write 


== 2 
dz Azo Az a 


But if the mapping function is analytic, we know the derivative at this point does not 
depend on the way Az approaches zero. Thus the right side of Equation 6.255 must 
be the same for any choice of @ in the z-plane. This means that as a is changed, 8B 
must also change in such a way that 


B-a@=C, (6.256) 


where cy is a constant. All infinitesimal lines passing through z, will be rotated 
through the same angle as they are mapped onto the w-plane. Therefore, all complex 
functions which are analytic are conformal mapping functions. 


6.12.3. Solutions to Laplace’s Equation 


Conformal mapping functions provide a method for solving Laplace’s equation in 
two dimensions. In a two-dimensional Cartesian geometry, Laplace’s equation is 


; er 8 
V‘@®(x, y) = 52 + ae D(x, y) = 0. (6.257) 


If a mapping function w = w(z) with 


zZ=xtiy (6.258) 
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and 
w = u(x, y) + iv(x, y) (6.259) 


is analytic, the functions u(x, y) and v(x, y) must satisfy the Cauchy-Riemann con- 
ditions 


du(x,y) _ dv(x, y) 


2 oy 6.260 
du(x,y) _ _ du(x, y) ware 
oy ox °° 


and these partial derivatives must al] be continuous. Consequently, the second deriva- 
tives obey 


d°u(x,y) _ 0? v(x, y) 


ax? axdy 
(6.261) 
Pu(x,y) _ _ 0? v(x, y) 
dy? dyex — 
This says that 
FA Vea =O (6.262) 
Ox ay? ’ y ’ y * © 
Equations 6.260 can be manipulated to give the same result for u(x, y): 
ee (x,y) = V20Gy) = 0 (6.263) 
oe) ay? vx%,y)= u(x, y . A 


Therefore, if a mapping function is analytic, the functions u(x, y) and v(x, y) each 
satisfy a two-dimensional Laplace equation. Either one could be interpreted, for 
example, as an electrostatic potential or as the pressure in a fluid flow problem. 


Example 6.14 In electrostatics, the electric potential satisfies Laplace’s equation in 
a charge-free region. Conformal mapping functions can therefore be used to generate 
solutions to two-dimensional electrostatic problems. Consider the function 


w=7, (6.264) 


As discussed above, the real part of w(z) is a solution to Laplace’s equation and can 
be identified with an electrostatic potential function ®(x, y). In this case, 


P(x, y) = u(x, y) = x? — y?. (6.265) 
The two-dimensional surfaces of constant potential are given by 


x? — y* = Constant. (6.266) 
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For example, the equipotential ® = 5 surface obeys 


x —y* =5, (6.267) 


and ® = 0 on the surface 
x —y? =0. (6.268) 
This last surface is written more simply as 
x=ty. (6.269) 


Both these surfaces have been shown in Figure 6.47. This means that if plates were 
placed on the x = +y and x” — y* = 5 surfaces and held at @ = 0 and ® = 5 
respectively, the electric potential in the space between these plates would be given 


by 

b =x? y’. (6.270) 
This is shown in Figure 6.48, where we have drawn the equipotential surfaces between 
the plates for ® = 1,2,3 and 4. 


The electric field lines between these plates can also be determined by the mapping 
function. The relationship between the electric potential and the electric field is 


E = -Vo. (6.271) 
Because ® = u(x, y), 
B= — Gg _ dul, y) é,. (6.272) 
ax oy 


Since field lines always run tangent to the field, if y = y(x) is a field line, then 


dy(x) _ Ey _ —0u(x, y)/ay 
dx E, —ou(x, y)/dx" 273) 


Figure 6.47 Boundary Conditions for the Electrostatic Potential Between Two Sheets 
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Figure 6.48 Equipotential Lines Between the Two Sheets 


But this mapping function is analytic, and so its real and imaginary parts satisfy the 
Cauchy-Riemann conditions. Consequently, 


dy(x) _ _ du(x, y)/ox 


=, 6.274 
dx du(x, y)/dy . 
Equation 6.274 can be rearranged to give 
we? sp i AGEL ee) (6.275) 
IX 


The left side of Equation 6.275 is just the total differential of u(x, y), so the field 
lines are simply lines of constant u(x, y). For this particular problem, the field lines 
are given by v(x, y) = 2xy and are shown in Figure 6.49. 

We could also interpret this solution as the laminar flow of a fluid, guided by the 
surfaces u = 0 and u = S. In this case, the lines of constant u would be the flow 
lines, while the lines of constant v would be lines of constant pressure. 


Any conformal mapping function will generate solutions to Laplace’s equation. 
One approach to solving Laplace’s equation in various geometries would be to gen- 
erate a large table of mapping functions, with a corresponding list of boundary 
geometries, and then, much like using an integral table, find the solution to any given 
problem by looking it up. This is cumbersome, at best. A more direct way to arrive at 
the proper mapping function, given the boundary geometry, is presented in the next 
section. 
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Figure 6.49 Electric Field Lines Between the Two Sheets 


6.12.4 Schwartz-Christoffel Mapping Functions 


The previously outlined method for obtaining solutions to Laplace’s equation using 
conformal mapping techniques approached the problem from the wrong direction. 
The mapping function, with solutions to Laplace’s equation given by its real and 
imaginary parts, was specified, and then the boundary conditions associated with 
the solutions were identified. These types of problems are usually posed the other 
way around. The Schwartz-Christoffel method uses conformal mapping techniques 
to design a mapping function from a particular set of boundary conditions. 

To see how this method works, begin by considering mapping function w = w(z) 
and its inversion z = z(w) such that 


dz 
dw 


where w, and k, are pure real numbers, and A is a complex number. We then ask, 
given the functional form imposed by Equation 6.276, how does the v = 0 line map 
onto the z-plane? In other words, as w moves from — to + along the real w-axis, 
as shown in Figure 6.50, what sort of line is traced out in the z-plane? The quantity 


= Aw — w,)*, (6.276) 


imag w-plane 


real 


< 


Figure 6.50 Real w Mapping 
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, imag w-plane 
6=n 
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Figure 6.51 Phase of (w — w,) for w to the Left of w, 


dz/dw, on the LHS of Equation 6.276, is complex and, in polar notation, has both a 
magnitude and a phase. Given the form of Equation 6.276, the phase of dz/dw is 


d 
£4 = LA~ kyL(w— Wo) (6.277) 
dw 


where the operator 2 generates the phase of a complex number. Now consider the 
phase of dz/dw as w moves from — to + along the real w-axis. For a pure real 
w, with w to the left of w,, Figure 6.51 shows that 7 (w — w,) = 7. For pure real w, 
with w to the right of w,, Figure 6.52 shows that 2(w — w,) = 0. Therefore, for w 
on the real axis: 


dz { LA—k wr for w to the left of w, (6.278) 


ZA for w to the right of w, © 


As w moves along the rea} w-axis the mapping function will generate a line in 
the z-plane. As w is incremented by an amount Aw, z will change by an amount 
Az = (dz/dw)Aw. The phases of these Aw and Az steps are related by 


HR Ea Aw. (6.279) 


As w moves to the right along the real axis, 2 Aw = 0 so that 


LAzH=zZ at (6.280) 
= dw 
imag w-plane 
9=0 real 
-- @— oe : 
Wo w 
(w i Wo) 


Figure 6.52. Phase of (w — w,) for w to the Right of w, 
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Figure 6.53 The Mapping of the Real Axis from w, to + onto the z-Plane 


Let the point w, map into z,. Then as w moves away from w, toward +° in steps 
of Aw, z moves away from z, in steps of Az. The result of the mapping of this part 
of the real w-axis is shown in Figure 6.53. The series of Aw’s, ali at an angle of 
zero and all of the same length, are mapped into a series of Az’s. The 4z’s are not 
necessarily the same length but, by Equations 6.278 and 6.280, they all have the 
same phase, 7 A. The mapping of the real w-axis from —© to w, can be done in the 
same way, but now all the Az’s are at an angle equal to 2A minus k,7, as shown 
in Figure 6.54. Consequently, the mapping function that obeys Equation 6.276 maps 
the real w-axis into a line on the z-plane made up of two straight-line segments, as 
shown in Figure 6.55. There is a break point in this line at z,, where the slope of the 
line changes by k,7. 

These ideas can be extended to a mapping function whose derivative has the form 


ot = A(w an w1) !w — w2) @(w os w3)% eae (w = Wr), (6.281) 


Figure 6.55 Mapping of the Real w-Axis onto the z-Plane 
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imag 
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Figure 6.56 The General Form of a Schwartz-Christoffel Mapping 


where w), W2, W3 °**W, are successively increasing real numbers, and ky, k2, k3, 
-+-+k, are arbitrary real numbers. The mapping function that has this form for its 
derivative maps the real w-axis into connected straight-line segments in the z-plane, 
as shown in Figure 6.56. In this way, the derivative of the mapping function can 
be set up so that the real w-axis can be mapped into an arbitrary shape made up of 
straight-line segments. To obtain the mapping function w(z), Equation 6.281 must be 
integrated 


zw) =A ‘| dw {0 — w1)“8(w — w)-* (6.282) 


(w— ws) — Wa) 


and then inverted. 

The real w-axis can map into a set of straight-line segments that may close to form 
a polygon in the finite z-plane, or they may extend to infinity. A variety of boundary 
shapes can be constructed. Some of the more complicated geometries arise from 
allowing w,, the last point along the real w-axis, to be located at +0. In this case, 
the (w — w,)~* factor in Equation 6.281 is eliminated, because it can never change 
the phase of dz/dw. Even though this point does not enter Equation 6.281, it can 
map into a point w,, where there is a k, 7 angle change. In addition, the Schwartz- 
Christoffel method can be applied along any horizontal line in the w-plane. In this 
way the value of the function on the boundary can assume more than one value. 


Example 6.15 In this example, we determine the electric potential and field around 
a sharp, two-dimensional spike. The spike and ground plane are held at zero volts, 
while the potential goes to infinity at an infinite distance above the ground plane. 
This geometry is shown in Figure 6.57. Specifically, P(x,0) = 0 for all x, and also 


®=0 Y 


Figure 6.57 Grounded Spike Geometry 
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a kam = +10/2 
Figure 6.58 Mapping of the Real w-Axis for the Spike Problem 


@(0, y) = 0 for 0 < y < J. We will try to find a solution for ® in the y > 0 half 
plane. 

To use the Schwartz-Christoffel mapping for this problem, the real w-axis must be 
mapped onto this complicated shape. The line in the z-plane can be set up with three 
break points, as shown in Figure 6.58. These three break points, z,, z,, and z,, are the 
mapping of the points w;,w2, and w3, which all lie on the real w-axis, as shown in 
Figure 6.59. At this point, the exact positions of w;, w2 and w3 are arbitrary, except 
for the condition that w; < w. < w3. As w moves along the real w-axis, z moves 
along the path of Figure 6.58, first encountering the break point at z,, then the one 
at z,, and finally the one at z,. At the z, breakpoint, there is a 7/2 counterclockwise 
bend, and so k, = 1/2. At the z, breakpoint, there is a 7 clockwise bend, and so 
ky = —]. Finally, at the Zy breakpoint, there is another a/2 counterclockwise bend, 
and so k3 = 1/2. The equation for dz/dw can therefore be written as 


d 
rs = A(w ~ w1)7!/2Qw — wa)" w — wy, (6.283) 


At this point, a bit of trial and error is necessary to figure out the constants A, w), 
W2, and w3. We must select these constants so the mapping function z = z(w) has 
z(w 1) = 0, z2(w2) = i, and z(w3) = 0. Of course, we cannot check our guess until 


imag w-plane 
real 
—_—__—_@—_—_____@—_+__9- -—____—_ 
WwW, Ww? W3 


Figure 6.59 Mapping of Break Points onto Real w-axis 
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after Equation 6.283 is integrated. From the symmetry in the z-plane, it makes sense 
to make w, = 0 and require w; = —w). As a trial set of constants, let w; = —1, 
w2 = 0, and w3 = | to give 


dz w 
S=A = : 
dw "(wt 1)!2w— 1)? 


(6.284) 


The constant A is just a scaling and rotating factor, which can initially be set equal to 
1 and changed later if necessary. Equation 6.284 can then be rewritten in the form 


dz w 


= = —=>—., 6.285 
ai eal ( ) 
which can be reorganized as the indefinite integral 
/ dz= | dw—=—. (6.286) 
z a | 


Ignoring the arbitrary constant of integration, the solution of this integral is 
z=Vw-t. (6.287) 


Equation 6.287 can be inverted to give 


w= 2th. (6.288) 


Now this mapping must be checked to see if the constants were chosen properly. 
When w = wy = —1,z2 =z, = 0. Whenw = w. = 0,2 =% = f =1 = +i. When 
w = w3 = +1,z = z, = 0. So it looks like all the constants are correct. Notice when 
w = wy = 0, two values of z, are possible. The one that is correct for the geometry 
of this problem is z, = +i. The second value simply corresponds to a solution in the 
lower half of the z-plane. 

A check of Equation 6.288 will show that it is analytic everywhere in the z-plane, 
except at the points z = +/. Consequently, this mapping function will generate 
a solution to Laplace’s equation everywhere, except at those two isolated points. 
To obtain the solution, the functions u(x, y) and v(x, y) must be determined. To 
accomplish this, rewrite Equation 6.288 as 


(ut ivy’ =(x+ iy +1. (6.289) 
Expand this equation, and separate its real and imaginary parts to get 
w-—yw=xr- y? +] (6.290) 
and 


2uv = 2xy. ; (6.291) 
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Figure 6.60 Equipotentials above the Grounded Spike 


Solving Equation 6.291 for u and substituting the result into Equation 6.290 gives 
ry — vt — xv? + yu? -— wv? = 0. (6.292) 


This equation can be used to generate lines of constant v in the z-plane, which we 
interpret as lines of constant potential. The ® = v = 0 line is just the ground plane 
and spike of Figure 6.57. Several other, equally spaced, equipotential lines obtained 
from Equation 6.292 are shown in Figure 6.60. A computer is useful in making these 
plots. 

The electric field lines are determined by setting u(x, y) = constant. The equation 
for these lines is obtained by eliminating v from Equations 6.290 and 6.291: 


ub — xy — x + yw — uw =0. (6.293) 


A plot of these field lines, for uniform steps of u between 0 and 1.4 is shown in 
Figure 6.61. Remember that in field line drawings such as Figure 6.61, the electric 
field is not necessarily constant along a field line. The intensity is proportional to the 
density of the field lines. So in Figure 6.61 it can be seen that the electric field is 
very strong near the tip of the spike, weakest at the bottom of the spike, and becomes 
constant far from the spike. 


Figure 6.61 Electric Field Lines above the Grounded Spike 
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EXERCISES FOR CHAPTER 6 


1. Use Euler’s formula to show that 
cos(n@) + isin(n@) = (cos 6 + isin 6)”. 
This result is called DeMoivre’s formula. 


2. Find all the zeros of these complex functions: 
i. sing. 
ii. COs z. 
iii. sinh z. 
iv. coshz. 
3. Make a sketch in the complex z-plane of the following relationships: 
i. (2 — 5)(z" — 5) = 4. 
ii. (z ~ Si)(z" + Si) = 4. 
iii, 2 + (2)* = 0. 
iv. |z| = 2+ zte 
% lal 


4. Show that natural logarithms exist for real, negative numbers in the complex 
plane. In particular, find the value of In (— 10). 


5. Determine all the values of i! and plot them in the complex plane. As a first step, 
write ? in polar form. 


6. Find the real and imaginary parts of the following complex functions: 
i. w(z) = e. 
ii, w(z) = Inz. 
iii. w(z) = sing. 
iv. w(z) = 1/2. 
v. w(z) = 1/2’. 
vi, w(z) = In(z’ + 1). 
vii. w(z) = el, 
viii, w(z) = 1/cosz. 
ix. w(z) = 2/2". 
x. w(z) = 2" Inz. 
In what region of the complex plane are each of these functions analytic? 
7. Find an analytic complex function that has a real part of u(x, y) = x° — 3xy?. 


8. Using the polar forms for z and w, 
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9. 


10. 


11. 


12. 
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a complex functional relationship between w and z can be written as 
wg) = R(r, 0) 2°. 


The functions R(r, 0) and @(r, 6) are real functions of the real variables r and 6. 
These functions are the polar analogues of u(x, y) and v(x, y). If w(z) is analytic, 
show that the Cauchy Riemann conditions become 


OR | RaO 
or r 00 
and 
10R 0 
r 06 or 


Show by direct integration that 


n_ J 2m n= 

where C is any counterclockwise loop that encloses z,, and n is an integer. 
Develop the Taylor series and identify the region of convergence for the following 
functions: 

i. w(z) = z_ expanded about z = 2. 

ii. w(z) = cosz_ expanded around z = 0. 
iii. w(z) = cosz expanded around z = 77/2. 

iv. w(z) = In(g + 1) expanded around z = 0. 

v. wz) = V1 - z expanded about z = 0. (up to terms of 2) 


Develop the Laurent series and identify the region of convergence for the fol- 


lowing functions: 
i, w(z) = el expanded about z = 0. 
ii, w(z) = 1/(z — 1) expanded about z = 2. 
2 
iii, w(z) = aa expanded about z = 2. 
2 
iv. w(z) = @-m expanded about z = 1. 


Find the Taylor and Laurent series expansions around z = 0 and indicate the 
regions of convergence for the functions: 
1 
(zg — 2)?" 
cos(z) 


ii, w(z) = eG 


i, w(Z = 
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e 

oot 

13. Use the Cauchy integral approach to determine the Taylor series expansion and 
region of convergence of the following functions around an arbitrary point z,: 
(a) w(z) = z. 
(b) w(z) = 2. 
(c) wiz) =z-@. 
d@) wi) = 2 /q- a). 

14, Using the expansion in Equation 6.100, 
(a) Find the Taylor series expansion about z = 1 + i for 


th 
P+ 22° 


iii, w(z) = 


wz) = 


(b) Find the Laurent series expansion about the same point for the same function. 
15. Using the Cauchy Integration approach, and the complex function 


1 
w(z) oe T+ 2’ 


(a) Determine the Taylor series expansion for w(z) about z = 0 and indicate the 
region of convergence. 
(b) Determine the Laurent series expansion for w(z) about z = 1 and indicate 
the region of convergence. 
16. Suppose we need to expand the function 


1 
w(z) = z+1 


with a series that is valid on the closed contour C, a unit circle centered at z = 1, 
as shown below: 


imag z-plane 


real 


(a) First consider the Taylor series expansion for w(z) about z = z,, 


J [e<) 
reg aes 
& n=0 


17. 


18. 


19. 
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Identify the region of the z-plane where z, can be placed so that this series 
will be a valid representation of the function on the contour C and determine 
the c,,. 

(b) Now consider the Laurent series expansion for w(z) about z = z,, 


oo 


1 
Sia ee 


n= —00 


Identify the region of the z-plane where z, can be placed so that this series 
will be a valid representation of the function on the contour C and determine 
the c,. 


Consider the complex function 
1 = 1 
z-3 z-4 


w(z) = 


(a) Find the Taylor series expansion for w(z) about z = | and identify its region 
of convergence. 

(b) Find the Laurent series expansion for w(z) about z = 0, which converges in 
the annulus 3 < |z| < 4. 

(c) Find the Laurent series expansion for w(z) about z = 1, which converges in 
the annulus 2 < |z —- 1] < 3. 

(d) Compare the c_, coefficients in the series solutions to parts (b) and (c) above 
and comment. 

(e) Find the Laurent expansion for w(z) about z = 1, which converges in the 
region 3 < |z — 1. 

(f) How many different Taylor and Laurent series expansions for w(z) can be 
made about the point z = 3.5? 


Consider the function 
e% 


-—]° 


w(z) = 
Zz 


(a) Find the function that generates the coefficients of the Taylor series expansion 
for this function about the general point z = z,. Indicate the region of 
convergence for this series. 

(b) Find the function that generates the coefficients of the Laurent series expan- 
sion for this function about the general point z = z,. Indicate the region of 
convergence for this series. 


Consider the complex function 
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20. 


21. 


22. 


23. 


(a) Find the Laurent series for w(z) expanded about the point z = 1 and identify 
the region of convergence. 

(b) Find the Taylor series for w(z) expanded about the point z = 1 and identify 
the region of convergence. 

(c) Find the Laurent series for w(z) expanded about the point z = 0 and identify 
the region of convergence. 

(d) Why is there no part (d) to this problem? 


Verify that 
sinhz = 


by comparing the Taylor series expansions for sinh z and e£. Comment on the 
convergence. 


Prove that a Laurent expansion of a function about a particular point is unique. 
That is, show if 


oo 


w@ = >> a@-z,)" = > beH zy, 


n=—-N, n=—N, 
then a,, must equal b,, for all n. Use the Cauchy integral formula. 


Consider the function 


(a) Locate all the poles of w(z) in the z-plane and identify their order. 

(b) Find the Taylor series expansion for w(z) about z = 0. Identify the first three 
coefficients of this series and indicate the region of convergence. 

(c) How many different Laurent series for w(z) can be constructed about z = 0? 

(d) Finda series expansion around z = 0, either Taylor or Laurent, that is a valid 
representation of w(z) at z = 7. Indicate the region of convergence for this 
series. 


Consider the function 


w(Z) = PRrao 

(a) Where in the complex z-plane are the poles of w(z)? 

(b) Determine the first three terms for the Taylor series expansion of w(z) about 
z=0. 

(c) Identify the region of convergence for the Taylor series of part (b). 

(d) Determine the general expression for the n" coefficient of the Taylor series 
expansion of part (b). 
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24. 


25. 
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(e) There is a Laurent series expansion for w(z) about z = 0 in the region 
1 < |z| < 2 that has the form 


Obtain the general expression for the c,,. 


Consider the function 


w@) = Boag 


(a) Can this function be expanded in a Taylor series about z = 0? 

(b) Find two different Laurent series expansions for this function, both expanded 
about z = 0. Indicate their regions of convergence. 

(c) Find the Taylor series for this function expanded about z = 1 /2. Indicate 
the region of convergence. 

(d) Find the Laurent series for this function expanded about z = 1/2. Indicate 
the region of convergence. 

(e) What is the residue of this function at z = 0? at z = 12 atz = 1/2? 

(f) What is the value of the integral 


$ dzw(z), 
Cc 


where C is a counterclockwise circle centered at the origin of radius 1/2? 
of radius 1.5? 

(g) What is the value of the integral in part (f) above if C is a clockwise circle 
centered at z = 1 of radius 0.5? of radius 1.5? 


Identify the locations and orders of the poles for the following functions. For 
each of the functions evaluate the residue associated with each of its poles. 
2 


: __2 
i. w(Z) 21 
= sin z 
kv) > 
me 2(z — w/2) 
iii, w(z) = 4 : 
a sinz 
7 _ Inz 
iv. w(z) = BaI 
1 
v. wz) = 


z(e2 — 1) 
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ae 
caer 
26. Each of the following functions has a pole at z = 1. In each case identify the 


27. 


28. 


29. 


30. 


order of the pole and determine its residue. 


i. ————_.. 


gee eS 
a ol 
ii. 
ees 
iii 1 
iii, —. 
Inz 
. 1-—cos(z~ 1) 
lV. _— 
(= 1)? 


v. cot(z — 1). 


Let C be a counterclockwise, circular contour of radius 2 centered at z= 0. 


Evaluate the complex integrals: 


: COS Z 
i. pds 8, 
a 
c 8s 


Evaluate the integral 


Fee aa 
ft =@—DE-3)’ 


(a) where C is the clockwise, circular contour given by |z| = 2. 
(b) where C is the clockwise, circular contour given by |z| = 4. 


Evaluate the complex integral 


sin Zz 


a fia: (@— af 2g + wD)’ 


where C is the counterclockwise, circular contour given by |z + 7/2] = 


Evaluate the complex integral 


I 
dz=— 
f 23-8 


where C is the counterclockwise, circular contour |z ~ 1| = 2. 
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31, Evaluate the complex integral 


] 
eee 


where C is the clockwise, circular contour |z — 2i| = 1. 


sin z 
d ———=-, 
t 42-22 


where C is the counterclockwise, circular contour |z| = 1. 


32. Evaluate the complex integral 


33. Consider the complex integral 


fa sin Z 
c= @— w/e + 1/2) 


and the two contours C = C, and C = C, shown below. 


. imag 


z-plane 


Let J, be the integral around C, and J, the integral around C2. Show by contour 
deformation that J, — I; = —2i. 


34. Consider the function 
ae 
1 — sin (2z) 


(a) Find all the poles of w(z) and identify their order. 
(b) Evaluate the integral 
¢ dzw(2), 
c 


where C is a counterclockwise, circular contour given by Fa =], 


w(z) = 


EXERCISES 211 


(c) Evaluate the integral of part (b) above if C is a counterclockwise, circular 
contour given by |z — | = 1. 


35. Consider the function 


wg) = =. 
1 — cosz 


(a) Find all the poles of w(z) and identify their order. 


$ dzw(2), 
Cc 


where C is a counterclockwise, circular contour given by |z| = 1. 


(b) Evaluate the integral 


(c) Evaluate the integral of part (b) above if C is a counterclockwise, circular 
contour given by |z ~ 277] = 1. 


z 
dz——.. 
pe 


(a) Locate the poles of the integrand and identify their order. 
(b) Evaluate the integral if C is a counterclockwise, circle of radius 1 centered 


36. Consider the integral 


at 

i. z= 0. 
ii. 2 = 277. 
ii, 2 = 27. 


37. Consider the integral 


I= pacer, 
one z 


where n is a positive integer, and C is the counterclockwise circle |z| = 1. 
(a) Find the Laurent series expansion about z = 0 for the integrand. 
(b) Use the residue theorem to evaluate the integral. 


38. Convert each of the following integrals to an equivalent one in the complex 
plane. Identify all the poles of the complex integrands and their residues. Close 
the contours and evaluate the integrals. 


= COs x 
i. dx —, 
i x2 +4 


= “x sinx 
ii. dx ——_. 
0 xt+t 
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iii. i fmm 
0 x 
0 2 
wv. [ Feed call 
0 x +1 
‘CO e* 
je cosh (7x) 
s © sinh (ax) 
‘ —_—— —am<a<. 
vi if ax aD Tia<an 


39. Convert each of the following real integrals into a complex integral and evaluate 
it using the residue theorem: 


5 1 
i. d@ ——_—_—.. 
if 1 + sin? 9 

ii [i : 
do 1 + cos? 6° 


2 
cos @ 
iii. d@ ——___—_.. 
Fe | 5 + 4cos 0 


20 
. cos (30) 
7 ad@ ——_—_. 
i ; 5 — 4cos 0 


40. Show that the integral 


i? 1 
[ eae a 


where x is a pure real variable, can be evaluated by doing an integration in the 
z-plane around the closed contour shown in the figure below. 


. imag 


Identify the similar closed contour in the z-plane that can be used to evaluate 


the integral 
oe 
0 14+x47 


where m is any positive integer. 
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41. 


42. 


43. 


45. 


Consider the integral 


ra 1 
te [a 02 + 1) + 1)’ 


where we must use the principal part because of the behavior of the integrand 
at x = —1. Convert this into an integral in the complex plane and determine an 
appropriate contour of integration to evaluate this integral. Show that you obtain 
the same answer regardless of which side of the z = —1 pole you go around. 


Consider the integral 


00 ew imu 
aS | eae 

where x and u are real variables. 

(a) Without performing the integration, show that J(u) is a pure real function 
of u. 

(b) Convert this integral into the complex plane and evaluate it using closure 
and the residue theorem. 

(c) Plot J(%) vs. u. 

(d) The function /(u) can be converted to a complex function of a complex 
variable by substituting the complex variable w for u, i.e., J(u) — I(w). Is 
I(w) analytic? 


Use complex integral techniques to evaluate the integral 
° sin (kx) 
are 


where k and x are both real variables. 


. Discuss the problems with trying to close the following integral in the complex 


plane: 
co -2 
e 
dx >>: 
ee ee 
Consider the semicircular contour shown below in the limit as € — 0. 
-€ | imag & real 
Cc 
z-plane 


On this contour, evaluate 
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46. Use the results of the previous problem to evaluate 


1 
fia Bacty: 


where C is the closed contour shown below in the limit € — 0 and R, — ©. 


imag 


= 1 
PP d 
_~o xAUx-1) 
where x is a real variable? 
47. Consider the function 
COS Z 
w(Z) = eve 


(a) Evaluate the integral 


$ dzw(z), 
Cc 


where C is the counterclockwise, circular contour given by |z| = 1. 
(b) Evaluate the principal part of the integral 
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48. 


49. 


50. 


51. 


52. 


How does the function w = 2” map the unit circle centered at the origin of the 
z-plane onto the w-plane? 


Using the mapping function 


wz) = 


> 


IN [= 


(a) how does the unit circle |}w — 1| = 1 map onto the z-plane? 
(b) how does the line y = 1/2 map onto the z-plane? 
For each of the mapping functions below, indicate how the lines of constant u 
and v in the w-plane get mapped onto the z-plane: 
i. w(z) = e. 
-1 


ii, w(Q) = = 


iii. w(z) = Inz -— 1. 
What part of the z-plane corresponds to the interior of the unit circle centered at 


the origin of the w-plane for the following mapping functions? 
z—-1 


i. wz) = 


+ 


IN 1A 
| 
~ 


ii, w(z) = : 


IN 


+ 
Consider the mapping function 


Wore 
(a) Is this mapping function analytic? 
(b) Show how this function maps the grid lines of the z-plane onto the w-plane. 
Specifically map, on a labeled drawing, the grid lines shown below. 
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53. 


54, 


55. 
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(c) Is this mapping function conformal? Can it be used to generate solutions to 
Laplace’s equation? 


Consider the mapping function 


Z='1 
w(z) = ee 
(a) Where does the imaginary axis of the w-plane map onto the z-plane? 
(b) Where do the points in the right half of the w-plane map onto the z-plane? 
(c) Where do the points in the left half of the w-plane map onto the z-plane? 


The mapping function 


2 
wt) 
wz) = = 


can be used to analyze two-dimensional flow around a cylindrical obstacle. 
Uniform stream lines described in the w-plane by lines of constant v become, 
upon mapping onto the z-plane, the flow lines around a cylinder. The constant 
pressure lines for the flow around the cylinder are given by mapping the lines of 
constant u onto the z-plane. 


(a) Plot the lines v = —3, —2, -1,0, +1, +2, and +3 onto the z-plane. 
(b) Plot the lines of constant u onto the z-plane. 


(c) The velocity field for the flow is proportional to the gradient of the pressure 
distribution function, i.e., 


v= —pV P(x, y), 


where 1 is a positive constant called the mobility, and P(x, y) is the pressure 
distribution function. What is the velocity field, V(x, y), for the flow around 
the cylinder? 


Use the Schwartz-Christoffel method to determine the functional relationship of 
dz/dw that takes the real w-axis to the step in the z plane shown below. Integrate 
and invert this expression to obtain the mapping function w = w(z). Make an 
attempt to plot the lines of constant v onto the z-plane. If you have access to 
computer plotting software, you should be able to make very accurate maps of 
the constant v lines. 
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56. Use the Schwartz-Christoffel conformal mapping technique to find the stream 
function for flow over a surface that is initially flat and then rises at 45°, as shown 
below. 


w-plane 


(a) Find the mapping function that takes the real w-axis into the line in the 
z-plane made up of the negative real axis and a segment out to infinity at 
6= W/4. 

(b) Find the function that takes stream lines in the w-plane to stream lines in the 
z-plane. NOTE: The last couple of lines of algebra in this calculation get a 
bit messy, but the answer is obtainable. 


57. Recall, the mapping function generated by the Schwartz-Christoffel method is 
given by the expression 


d 
SE = A(w ~ wi) — wa) Bw = wa) 
dw 


(a) How does the real w-axis map onto the z-plane if: A = e'7/4, w, = —1, 
ky = 1/2, w. = 0,kp = —1, w3 = 1, and k3 = 1/2? Draw a sketch of this 
mapping. 

(b) How would your answer to part (a) change if: A = 1, w) = —1l,w2 = 1/2, 


and w3 = 2, while all the k values stayed the same? 


58. Using the Schwartz-Christoffel method, design a mapping function that takes the 
real w-axis to a combination of the negative real z-axis and the positive imaginary 
z-axis with w = 0 mapping into z = 0. 


w-plane z-plane 


Using this mapping function, map the 45° lines shown below in the z-plane onto 
the w-plane. 
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z-plane 


59. Show that if 


then 


w= sin“! z = ~i In fiz + 0 - eae 


7 


FOURIER SERIES 


This chapter consists of four sections. In the first section, we present a quick refresher 
of Fourier series methods and a few examples. The second section demonstrates the 
utility of the complex, exponential form of the Fourier series. These two sections 
should provide a framework for the subsequent chapters on Fourier and Laplace 
transforms. The third section briefly discusses the convergence properties of Fourier 
series and introduces the concept of completeness. In the final section, we discuss the 
discrete Fourier series, which is a numerical technique for obtaining Fourier spectra. 


7.1 THE SINE-COSINE SERIES 


A general Fourier series expansion is the sum of a potentially infinite number of sine 
and cosine terms: 


s(t) = at S> an cos ant + S> by sin ent. (7.1) 


n=1,2,... n=1,2,... 


Each term oscillates at an frequency w, = 2am/T,, an integer multiple of the 
fundamental frequency w) = 27/T,. The value of 7, is determined either by the 
periodicity of the function the series is supposed to represent, or a periodicity we force 
the series expansion to take. The form of Equation 7.1 guarantees that the function 
generated by the series has a periodicity of T,. In other words, it obeys the condition 


S(t) = s(t + T,) for all t. (7.2) 


An example of such a function is shown in Figure 7.1. If the independent variable is 
time, then s(t) has gone on for all past time and must continue forever. 
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Figure 7.1 Periodic Function with Basic Period T, 


In a typical Fourier series problem, we are given a particular function f(t), and 
we want to determine both 7, and the a,, and b, coefficients, so the series is equal to 
F(o), for all t. For this to be possible, f(t) itself must be periodic and exist for all t, 
i.e., 


ft) = f(t+T>) — forallt. (7.3) 


The smallest value of the constant 7, that satisfies Equation 7.3 is defined as the 
period of f(t). 

If f(t) is not periodic, a Fourier series cannot be used to represent f(t) for all r. 
We can, however, make the series equal to f(t) over some finite interval. Consider 
the nonperiodic f(t) shown in Figure 7.2, and the interval t; < t < t,. We can define 
the basic period for the Fourier series to be T, = rt, — t,, and the generated series 
can be made identical to f(t) over this interval, as shown in Figure 7.3. Outside the 
interval, the Fourier series s(t) is periodic and will not match the function f(t). 


Figure 7.2. A Nonperiodic Function 


(2 ai s(t) 


Figure 7.3 Fourier Series Fit to a Nonperiodic Function 
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7.1.1 The Orthogonality Conditions 


To determine the coefficients a, and b,, we must first derive a set of equations called 
the orthogonality conditions. These relations involve integrals of products between 
the sine and cosine functions, i.e., (sin)(cos), (sin)(sin), and (cos)(cos). 

The derivation starts by considering what happens when either the sine or cosine 
functions are integrated over an integer multiple of a period. For any integer n + 0, 
it is obvious that 


to+T, to+T, 

0 o e/ 0 o 
/ dt sin ( =) = if dt cos (Fr) =0. (7.4) 
lo To to T, 


The integrals in Equation 7.4, and the ones that follow, are valid for any t,. Forn = 0 
the first of the above integrals is still zero. However, the n = 0 integral for the cosine 
function takes on the special value 


tot+T, 
‘| dt cos(O) = To. (7.5) 
fo 


The first orthogonality condition involves an integral of the product of sine and 
cosine functions. Because 
sin(a + B) + sin(a — B) 


sina cos B = 5) 


(7.6) 


Equation 7.4 implies 


to tT, 
ee 27m 27m 
dt sin | — —t}= TA 
[ r sin (771) cos (22 r) 0 (7.7) 


for all integer values of n and m including zero. The other orthogonality conditions, 
for integrals involving the products of the sine function with other sine functions and 
the cosine function with other cosine functions, are obtained using 


cos(a — B) — cos(a + B) 


sina sin B = ) 
cos a cos B = costa =) 5 COAG B) (7.8) 


With Equations 7.7 and Equation 7.5, we can write for n > 0 and m > 0 


to +T, 
ae 2 2 T, 

i: dt sin ( | sin ( =) = Sn (7.9) 
oe 2 2 Tt 

i dt cos ( Ft] cos ( =) = Bam > (7.10) 


and 
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7.1.2 Evaluating the Coefficients 


The orthogonality conditions allow us to isolate and evaluate the individual coeffi- 
cients of the Fourier series. If we want a series to equal f(t), we write 


fi) = ae San COS at + S) by sin wt. (7.11) 
n=1,2,... n=1,2,... 


The a, coefficient is obtained from 


fo+T, to+To a 
/ arf) =f dt |—2 + By Qn COS Wat + = b,Sin@,t|. (7.12) 
to to 


n=1,2,... n=1,2,.. 


Using Equation 7.4, we can see that the only term in the expansion that does not 
integrate to zero is the a,/2 term. Thus 


fo+To fo+To a aT. 
d = a = —, . 
[ t f(t) i t 7 5) (7.13) 
or 
2 lo tT, 
ay = ral dt f(t). (7.14) 
To Ji, 


The remaining a, coefficients can be obtained in a similar manner. Operating 
on Equation 7.11 by an integral weighted by cos w,,t, where m > 0, and using the 
orthogonality conditions gives 


ic * "dt cos(@mt) f(t) = > GnSnm na = ane. (7.15) 


e n=1,2,.. 


Thus, the a“ coefficient can be evaluated using the expression 


2 to+T, 
an = -f dt cos(@pt) f(t). (7.16) 
o Jt 


Notice that Equation 7.16 is also valid for m = 0. This was the reason the factor of 
2 was used in the a, term of Equation 7.11. With that convention, the a, term does 
not need to be treated differently from the other a,, terms. The b, coefficients are 
obtained in exactly the same manner, except the integrals are performed with sin @nt. 
The orthogonality equations give 


00 


totTo T, iB 
| dt sin(@mt) f(t) = ah ma = Oma (7.17) 


with the result that 


2 to+T 
bn = = i dt sinant FO: (7.18) 
oO to 
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7.1.3 The Fourier Series Circuit 


There is an interesting way to view the summation of terms in a Fourier series 
which we will borrow from the field of electronics. We can model each term in 
the summation by a signal generator, which puts out a voltage signal at the proper 
frequency and amplitude for that term. This is shown in Figure 7.4. The a, term is 


@ bs sin @st oS 

bz sin @zt 
@ CF) 
©) b, sin Q@t A) 


T a,/2 (4) s(t) 


€) a1 COS @it o 
@2 COS Ght 

© carro © 
a3 COS Ost 

o-— & 


/ 
/ 


Figure 7.4 Fourier Sine-Cosine Series Circuit 


224 FOURIER SERIES 


a d.c. source of a,/2 volts. The b, term is a sine wave oscillating at frequency @, 
with a signal amplitude of b, volts. The summation is accomplished by joining all 
the signals into one output from the circuit. This is the basis for what the electrical 
engineers call the frequency spectrum of a signal. Conceptually, they often break 
a signal into its various Fourier series components, analyze how each component 
behaves individually, and then join the results back together to generate their final 
solution. 

We will return to this analogy in more detail when we discuss Fourier and Laplace 
transforms. 


Example 7.1 Asa very simple first example, consider the “expansion” of the func- 
tion, f(t) = sint. The function is obviously periodic, with T, = 2a. The Fourier 
series takes the form 


Jie ro) ) 
sint= > + dm cos(nt) + Sohn sin(nt). (7.19) 


n=l 


We could struggle through the integral equations of the previous section to pick out 
the values of the a, and b, coefficients. In this particular case, however, it is easy to 
see that the only nonzero coefficient is b;, which has a value of 1. 

The set of coefficients in the Fourier series is sometimes referred to as the spectrum 
of f(t). The spectrum for f(t) = sint is shown in Figure 7.5, where we have plotted 
the b,, coefficients versus n. In this case, there is only one nonzero point on the graph. 


! ' 
L.--—.. ~_.. e -° o- e ee 


1 2 3 4 5 6 
Figure 7.5 The Spectrum for f(t) = sint 


Example 7.2 As a second example, let f(t) be the periodic triangular waveform 
shown in Figure 7.6. Notice that this function has the same amplitude and period 
as the previously discussed sine wave. On the interval -7/2 < t < 32/2, this 
triangular wave can be expressed as 


2t/a -n/2<=ts 7/2 


fr = ie ~2/4 n/2<1<30/2 em) 


As before, the basic period of this signal is T, = 277, so w, = n. Because this is an 
odd function of t, we can immediately see that all the a, coefficients are zero. The b, 
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, fr(t) 


Figure 7.6 The Triangle Wave 


coefficients can be calculated from 


1 31/2 
b, = =| dt fr(t) sin(nt) 


WJ —n/2 


Dh pecs 32h A i 2r 
=— dt — sin(nt) + — 1 dt (2 — =) sin(nt), (7.21) 
TJ-n/2 7 TW Sa/2 W 
with the result that 
8 | at 
n= pe a (7.22) 


A plot of the spectrum for the triangle wave is shown in Figure 7.7. 


Dp 
1 


Figure 7.7. The Spectrum for the Triangle Wave 


Example 7.3 As a final example, let’s calculate the Fourier series for the square 
wave function fs(t), shown in Figure 7.8. If this function has unit amplitude and a 
period of 7, = 277, it can be expressed as 


+1 Ost<r 


fat) = { a es ge as (7.23) 


Again, this is an odd function of ¢ and so all the a, coefficients are zero. The b, 
coefficients can be evaluated from Equation 7.18: 
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Figure 7.8 The Square Wave 


27 
b, = i | dt fr(t) sin(nt) 
Tw JO 


7 2a 
aed ) dt sin(nt) — x i) dt sin(nt), (7.24) 
Ww JO 


WJSa 


with the result 


(7.25) 


The spectrum for the square wave is shown in Figure 7.9. 

By comparing Figures 7.5, 7.7, and 7.9, you will notice that as the waveform 
changes from a sine wave, to a triangular wave, and finally to a square wave, the 
relative amplitudes of the higher frequency components increase. 

The convergence of the Fourier series to the square wave function is demonstrated 
in Figure 7.10. Three plots are shown. In the first, just the n = 1 term of the series is 
shown. The middle plot shows the result of the sum of the n = 1 and n = 3 terms. 
The last plot on the right is the sum of the n = 1, n = 3, and n = 5 terms. Notice 
that, while the square wave itself is not well defined at its discontinuities, its Fourier 
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Figure 7.9 The Spectrum for the Square Wave 
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Figure 7.10 The Convergence of the Fourier Series to a Square Wave 


series converges to a value equal to the average of the step at these points. There is 
another interesting convergence effect going on at this discontinuity called the Gibbs 
phenomenon, which will be discussed later in this chapter. 


7.2 THE EXPONENTIAL FORM OF FOURIER SERIES 


It turns out that representing the Fourier series with sines and cosines is not very con- 
venient. This is because the sine and cosine terms need to be manipulated separately. 
There is an alternative representation of the Fourier series, the complex exponential 
form, which combines the two sets of terms into one series. We can convert the 
Fourier series of Equation 7.1 into exponential form using the identities 


cos 8 = ; (ef? + e7'*) (7.26) 
‘ Li -i 
sind = = (e gees eae ir (7.27) 


7.2.1 The Exponential Series 


The exponential form of the Fourier series follows directly from the sine-cosine form 


7 ao fo) [-) 
Mei a cos(w,t) + we sin(w,). (7.28) 


n=1t 
Using Equations 7.26 and 7.27, we can write 


Dm 


; a ; aA 
am COS(@ yf) a bin sin(@mnt) = ae (ein! +e =) +4 cf 
L 


(enh ee). (7,29) 


The RHS of this expression can be rearranged to give 


m bn i m bmn ~j 
Aim COS( Wt) + Dm SiN( Gt) = (= + =) eidmt 4 (= a =) ei (7,30) 
I Ll 
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Equation 7.28 can then be placed in the form 


s(t) = YS c, ee", (7.31) 
n=—o 
where 
ao/2 n=0 
Cn = & (A, — ibn) /2 n>0o0. (7.32) 


(a_, + ib_,)/2 n<O 


Notice that if all the a, and b, coefficients are real, Equations 7.32 imply c, = c*,. 
In that case, the exponential form can also be written 
s(t) = cp + 2 Real cel (7.33) 


n=] 


7.2.2 Orthogonality Conditions for Exponential Terms 


The c,, coefficients can always be obtained by determining the a, and b, coefficients 
using the techniques already described. Better yet, they can be determined directly 
using a new orthogonality condition: 


lot+To 
i; dt ete tm! = § To. (7.34) 
bo 


This condition follows directly from the definition of the exponential function and 

the previous sine and cosine orthogonality conditions. It is true for any f,, n, and m. 

The period 7, of f(t) is still related to the fundamental frequency by T, = 27r/ wp. 
If a function f(t) is to be expanded in an exponential Fourier series 


fo) 


fO= SO ce, (7.35) 


n=—o 


the c,, coefficients can be determined by using the orthogonality condition in Equa- 
tion 7.34. To do this, multiply both sides of Equation 7.35 by e~““=' and integrate 
over the period, 7,. This gives 


to+To a an 
dt f(je "= | dt Cpe ee Tome (7.36) 
i, pay 


n=—0 


Exchanging the summation and integration on the RHS of this expression and using 
the orthogonality condition gives: 


tot+T, ; ba 
dt f(pe en = Sn Te 
fooas Y. SuatTo 


‘0 n=—o 


=c,,To- (7.37) 


=m 


THE EXPONENTIAL FORM OF FOURIER SERIES 229 


Therefore, we can evaluate the c,, coefficient using the expression 


1 fot To : 
GQ = val dt fe", (7.38) 
To Ji, 


which works for all the c,,’s, including c,. 


7.2.3 Properties of the Exponential Series 


There are three important properties of the exponential form of the Fourier series that 
can simplify calculations. The first deals with the c, term of the expansion. Because 


1 sr 
G== dt f(t), (7.39) 
Pde 


C, is just the average value of f(t) over the period. This is often written as 


Co = (f(t). (7.40) 


If f(4) is a real function, c, must be a pure real number. 
The even or odd nature of f(t) places conditions on the c,. If f(t) is a real, even 
function, then f(a) = f(—a), and 


i f® ; 
— dt f(ie *°" 
a= F [ Fide 


ae a 
=— if dt f(-—ne'". (741) 
To Jo 
If a change of variables is introduced, with a = —t and da = —dt, this equation 


becomes 


tO 
3 
| 


1 fp : 
= -7 if da f(a)e'* 


0 
ae | da f(a)ei"*. (7.42) 
T, J-r, 
The integrand f (a)e’"* is periodic in a with a period of T,. Therefore, 
| ae 
C,= => i da f(aje’™. (7.43) 
Ts Jo 
Now take the complex conjugate of both sides of this equation to get 
To 
C= = | da f*(a@e'”. (7.44) 
T, 0 


If f(t) is areal function of t, then f*(t) = f(t), and we obtain the result 
C, = C. (7.45) 


Therefore, if f(t) is areal, even function of t, then the coefficients of its exponential 
Fourier series are all pure real. The Fourier series expressed with sines and cosines 
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actually has an analogous symmetry. A real, even function expressed in that form 
will not contain any sine terms. Similar arguments show that if f(s) is a real, odd 
function, then c>¥ = ~—c,, and the coefficients are pure imaginary. The analogous 
statement for the Fourier series expressed in terms of sines and cosines is that any 
real, odd function will not contain any cosine terms. If (1) possesses neither odd nor 
even symmetry, then the coefficients of the exponential series are complex. In this 
case the sine/cosine series is made up of both sine and cosine functions. 


7.2.4 Convenience of the Exponential Form 


A major advantage of the exponential form of the Fourier series becomes evident 
when it is used with differential equations. If sines and cosines are used instead of the 
exponential notation, you must keep track of two sets of series, which can become 
horribly intertwined when derivatives are taken. With the exponential notation, all 
the terms are treated on equal footing, so the bookkeeping is much easier. 

Consider a nonhomogeneous, linear differential equation with constant coeffi- 
cients: 

(4% ry ae c) f(t) = d(t) (7.46) 
dt? dt i , 

where d(t) is a known function, and the problem is to find f(t). If d(t) is periodic, it 
can be represented by a Fourier series 


CO 


di) = 5° Dye", (7.47) 
with 
1 
D,=— | dtd(te i. (7.48) 
T.,.J6 


The coefficients D,, and the frequencies w, are both known quantities. The unknown 
function f(t) can be set equal to its Fourier series 


fO= YO Fee. (7.49) 


n=—0 


Here, the F,, are unknown, but the w, are the same frequencies used in Equation 7.48, 
which were determined by the periodicity of d(r). This will always be the case for 
linear differential equations. For nonlinear equations, f(*) can contain frequencies 
which are not in d(f). 

Substituting Equations 7.47 and 7.48 into the differential equation converts it into 
an algebraic equation, because all the dependencies on the independent variable t 
cancel out: 


(—@2A + iw,B + C) F,, = Dy. (7.50) 
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Equation 7.50 can be solved for the unknown coefficients F,, and used to express the 
solution in terms of a Fourier series: 


-D 
Aw? ~ io,B— C 2) 
© —D 
t= =n lant : 
JO= De (saree) Gen 


n=—o© 


If sines and cosines had been used to expand the Fourier series for d(t) and f(t), 
the time dependence could not have been removed as easily. It might be a worthwhile 
exercise to try this for yourself. The Fourier series approach outlined in Equations 
7.49-7.52 will be described in more detail in Chapter 10, when we discuss solution 
methods for differential equations. 


7.3. CONVERGENCE OF FOURIER SERIES 


Up to this point we have assumed that a periodic f(t) could always be expressed with 
an exponential series 


f= YO cei, (7.53) 
n=—o 
with ¢,, coefficients given by 
ee a , 
Cc, = =f at fine '". (7.54) 
21 0 


Actually, the equivalence of f(t) and the series on the RHS of Equation 7.53 was just 
an assertion that we must justify by demonstrating that the Fourier series converges 
to the function f(t). In fact, this will not be possible for all types of functions. 

This section presents the conditions necessary for the convergence of Fourier series 
and introduces some new terminology. The proofs of the statements in this section 
are too involved for this short chapter. We present only a few important results here 
without proving them. A complete and rigorous reference for this material is the book 
Fourier Series by Georgi Tolstov. 


7.3.1 Types of Convergence 


In general, to show convergence, one starts with a function, f(t), and calculates 
the Fourier coefficients c, using Equation 7.54. This step actually places a first, 
obvious condition on what types of functions can be represented by a Fourier series. 
It must be possible to perform the integration of Equation 7.54. Next, a version of the 
Fourier series with a finite number of terms is constructed using these coefficients. 
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Convergence is proved by showing as the number of terms increases, the series 
gets arbitrarily close, by some measure, to f(t). There are several different types of 
convergence. We will discuss three in this section: Uniform, Pointwise, and Mean- 
Squared convergence. 

Pointwise convergence is simple to understand. Define the partial Fourier series 
Sy(t) as 


n=+N 
Sv) = >> cee. (7.55) 


n=—-N 


This sequence of functions is said to converge pointwise to the function f(t) in the 
rangea =t = bif 


jim (Sv) — FO] = 0 (7.56) 


for any given point ¢ in that range. 
The same series is said to converge to f(f) in a mean-squared sense if 


b 
Jim. i dt |Sy(t) — f(D)? = 0. (7.57) 


This is a less strict condition. Notice the sequence Sy(t) = t% converges to the 


function f(t) = O over the range 0 = ¢ = 1 in the mean-squared sense, but not in a 
pointwise sense, because of the behavior of the function at ¢ = 1. 

Uniform convergence is the most strict convergence of the three. The sequence 
Sy(t) converges uniformly to the function f(t) on the range a = ¢t = bif 


jim ISu() — FOlmax = 0, (7.58) 


where max gives the largest value on the range. 


7.3.2 Uniform Convergence for Continuous Functions 


We state the following theorem without proof. If a periodic function is continuous 
and piecewise smooth, its Fourier series will always converge uniformly everywhere. 
A function is piecewise smooth if its derivative exists everywhere, except at isolated 
points. The triangle wave shown in Figure 7.6 is continuous and piecewise smooth, 
but the square wave of Figure 7.8 is not. 


7.3.3. Mean-Squared Convergence for Discontinuous Functions 


If a function is discontinuous, the Fourier series does not converge uniformly, but it 
may still converge in a mean-squared sense. A good example of a function with a 
discontinuity is the square wave function we discussed in an earlier example: 
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(7.59) 


The sum of the first three terms of the Fourier series was shown progressively in 
Figure 7.10. Figure 7.11 shows the transition region for the summation of N = 40, 
60, and 200 terms of the partial Fourier series. The are two complications that arise 
with the convergence of the Fourier series at points of discontinuity. The first problem 
is fairly obvious, while the second problem is quite subtle. 

The first problem is evident when you consider the point t = 0. According to 
the definition of f(t) in Equation 7.59, f(0) = 1. Clearly, looking at Figures 7.10 
and 7.11 shows the Fourier series does not converge to this value, even as N —> ™, 
but rather to the average value of the step. In general, if a periodic function has a 
discontinuity where the function jumps from f; to fo, its Fourier series will converge 
to a value of 


fit fe 


5 (7.60) 


at the discontinuity. 

The second complication of the convergence at the discontinuity is less obvious. 
Again consider the summation of the first three terms of the Fourier series for the 
square wave, shown in Figure 7.10. As N is increased from 1 to 2 and then to 3, the 
partial sum Sy(t) looks more and more like a square wave, but there consistently is 
an oscillation before and after the rising step. You might expect this oscillation to 
shrink in size as the number of terms in the sum increases, but this does not happen. 
In Figure 7.11, notice the overshoot recovers faster as N increases, but its amplitude 
remains constant at a value around 1.17. 

This overshoot of the Fourier series at a discontinuity is referred to as the Gibbs 
phenomenon. In the region surrounding the discontinuity, the series is converging 
point by point, but not uniformly. Given any point t = f,, except exactly the point of 
discontinuity, we can increase N until Sy(to) — f(t.) is arbitrarily close to zero. We 
cannot, however, decrease |Sy(t) — f(¢)|max below the value of 1.17 by increasing N. 

This leads us to a more general form of the convergence theorem of the previous 
section. If a function f(t) is piecewise smooth and piecewise continuous, the Fourier 
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Figure 7.11 The Gibbs Phenomenon 
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series converges pointwise to the value of f(f) at all points of continuity, and to the 
mean value of the step of any points of discontinuity. As a whole, the entire series 
converges to f(t) uniformly if the function is continuous, but only in a mean-squared 
sense if it has discontinuities. 


7.3.4 Completeness 


Mathematicians often like to phrase these convergence theorems using the idea of 
completeness. A set of functions is said to form a complete set over a functional class 
if a linear combination of them can express any of the functions in that class. The 
complete functions are often called basis functions, because they span the space of 
possible functions, much like the geometric basis vectors span all the possible vectors 
in space. 

Do the exponential terms of the Fourier series form a complete set? In the discus- 
sion above, we have said that they form a complete set over the class of piecewise 
smooth, continuous functions if we require uniform convergence. The theorem in the 
previous section shows they also form a complete set over the class of piecewise con- 
tinuous, piecewise smooth functions if we only require mean-squared convergence. 
Notice we must always declare what kind of convergence is being used before we 
can say whether a collection of basis functions is complete. 


7.4 THE DISCRETE FOURIER SERIES 


The coefficients of the Fourier series, either the a, and b, of Equation 7.1 or the 
c, of Equation 7.31, describe the frequency spectrum of a signal. Often, in physical 
experiments, there is an electrical signal which we would like to analyze in the 
frequency domain. Before the advent of the modern, digital era, electrical spectra 
were obtained using expensive analog spectrum analyzers. In simplified terms, these 
used an array of narrow-band, analog filters which split incoming signals into different 
frequency “bins” which were then reported on the output of the device. Modern 
frequency analyzers obtain their spectra by digitally recording the incoming signal at 
discrete time intervals, and then using a computer algorithm called the Fast Fourier 
Transform (FFT). 

When a computer is used to perform the Fourier series analysis of a signal, it cannot 
exactly carry out the operations of the previous sections. Because a computer cannot 
deal with either continuous functions or infinite series, integrals must be replaced 
by discrete sums, and all summations must be taken over a finite number of terms. 
The following sections describe a simplified version of the FFT, which adjusts our 
previous methods in an appropriate manner for computer calculations. 


7.4.1 Development of the Discrete Series Equations 


The first step in the development of a discrete Fourier series for f(t) is to determine 
T,, the period of the signal. Next the computer takes 2N samples of f(t), spread 
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Figure 7.12 The Sampling of f(t) 2N = 8 Times 


evenly over the period, to generate the values 
Tok 
Kk = f (tz) k= ON fork = 0,1,2,...2N — 1, (7.61) 


as shown in Figure 7.12. The first sample fo is taken at tg = 0, and the last sample 
fon-1 is taken at bn-1 = T,(2N ra 1)/(2N). 
The standard exponential Fourier series is summarized by the pair of equations: 


fO= So cei (7.62) 
1 [% 
Cora f dt fie", (7.63) 


where w, = 21n/T,. These equations need to be modified for the computer. The 
new versions of Equations 7.62 should generate the sampled values f;, from a new 
set of c,, via some kind of finite sum: 


f= YS cel > fr = YS cneier*. (7.64) 
n=0 


n=—«% 


In Equation 7.64, w, is defined, as it was before, as w, = (27m)/T7,, and nis an integer 
which we will arbitrarily start at zero and let run to a currently undefined, upper limit 
Nmax. The continuous variable t has been replaced by #;, the set of discretely sampled 
times. 

To evaluate the c,, of Equation 7.64, an orthogonality condition similar to Equa- 
tion 7.34 is needed. We can try to construct one by converting the integral over t into 
a sum over all the values: 


To 2N-1 : 
[ dt ee tm! = Tb, s etn gmk = 2. (7.65) 
° k=0 
Notice that the summation over the discrete time index k, unlike the frequency index 
n, has a well-defined upper limit determined by the number of sample points. The 
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question now is, does the sum over k in Equation 7.65 lead to an orthogonality 
condition that allows a determination of the c,,’s in Equation 7.64? Answering this 
question will take some work. 

Equation 7.65 can be rewritten as 


2N-1 2N-1 
> el@nte o—iOmte — yy ef nok (7.66) 
k=0 k=0 


where we have substituted w, = 27n/T>. Defining 
r= ein) (7.67) 


the sum in the RHS of Equation 7.66 can be recognized as a geometric series, which 
can be expressed in closed form as 


2N-1 
_ 2N 
pk x 7 (7.68) 
l-r 
k=0 
The summation in Equation 7.66 can therefore be written as 
2N—1 ae ee 1 od ei2m(n—m) 
i@y, —iOmtk — 
D eitnte ar oe (7.69) 


Because n and m are both integers, the numerator of Equation 7.69 is always zero. 
The LHS of this equation is therefore zero unless the denominator is also zero. This 
occurs when 


n—m=0,+2N, +4N,.... (7.70) 


From our original definition, the integers n and m range from zero to the unspecified 
limit max of the sum in Equation 7.64: 


O=n= Mx 
OSm= nnax- (7.71) 
If we limit the sum in Equation 7.64 to 2N terms, that is yg, = 2N — 1, we can force 
jn —m| S2N - 1. (7.72) 


This important result makes the denominator of Equation 7.69 zero only whenn = m. 
This is exactly what we need to construct the orthogonality condition. All that remains 
is to evaluate Equation 7.69 for n = m. We can easily accomplish this by returning 
to Equation 7.66 and substituting (n — m) = 0. Our final orthogonality condition 
becomes 


ei@ntee—itmk = § ON, (7.73) 


a 
Wt 
i] 


THE DISCRETE FOURIER SERIES 237 


This orthogonality condition can now be used to invert Equation 7.64 and eval- 
uate the c, for the discrete Fourier series. To do this, we multiply both sides of 
Equation 7.64 by e~'“»« and sum over k: 


2N-1 2N~1L 2N-1 
y eo iemtk = S- oe iOmte c,etenk 
k=0 k=0 n=0 
2N-1 2N-1 
a Cs elOntk 9 lemty 
n=0 k=0 


Using the orthogonality condition gives 


ONet 2N-1 
Se iontt f= S> caSam2N, (7.74) 
k=0 n=0 
with the result that 
1 N=! 
spl ~ toot 7.75 
oe Pag 2, fee (7.75) 


Thus the original equation pair which defines the Fourier series can, for the discrete 
Fourier series, be replaced by: 


ce 2N-1 
£00 = Yo cel! — fo = > cyeiett, (7.76) 
AES Se n=0 
1 T, ; 1 2N-1 
Cy = a dt fie" C= aN > freee", (7.77) 


7.4.2 Matrix Formulation 


Equations 7.76 and 7.77 are easily converted to a matrix format, which in turn, is 
easily handled by a computer. If we define the elements of the matrix [M] as 


May = ei@ntk = einnk (7.78) 
Equation 7.76 can be written using subscript notation as 
Se = €nMnk- (7.79) 


Likewise, Equation 7.77 becomes 


oe 
fa = Mra: (7.80) 
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The orthogonality condition, Equation 7.73, is simply 
MuMey = Snm2N. (7.81) 


Keep in mind, the subscripts of all these expressions run from 0 to 2N — 1. 

Notice that the [M] matrix is the same for all functions, it only depends upon the 
number of sample points. The computer obtains the coefficients of the discrete Fourier 
series by inserting the sampled values of f(t) into Equation 7.80 and performing the 
summations. 


Example 7.4 Consider a very simple signal, 
f(@® = cost. (7.82) 


The standard exponential Fourier series for this signal gives a spectrum with only 
two nonzero components, c; = c_; = 1 /2. These components occur at w = +1, 
the natural frequency of the signal. A discrete Fourier series can be constructed for 
this signal by sampling it four times over its 27r period, as shown in Figure 7.13. 
Table 7.1 shows the sampled values and sampled times. 

Any signal sampled four times in a period has 2N = 4 and an [MJ] matrix given 
by 


May = eb 2", (7.83) 


Figure 7.13 The cost Function Sampled Four Times 


TABLE 7.1. Sample Times and Values of cost 


k ty Si 
0 10) 1 
1 a/2 
2 7 —) 
3 3/2 
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which in matrix array notation is 
1 1 1 1 
_ fil i -l -i 
{M]} = 1-1 1-1 (7.84) 
1 -i -1 i 
Using Equation 7.80, we can evaluate the c,, 
Cg 1 1 1 1 1 
CD ad ai Sl i 0 
Cy 4/1 -1 1 -1 —1]? ED) 
C3 1 i -l -i 0 
with the result that 
Co 0 
| _|3 (7.86) 
1) 0 
&3 5 


A spectral plot of these coefficients versus w, = n is shown in Figure 7.14. 


Parts of this spectrum make sense, and other parts do not. The fact that cy and c, 
are zero, while c, is nonzero, agrees with the regular Fourier series spectrum. But 
the component at w; = 3 makes no sense, because there obviously is no w = 3 
component in cos ft. This suspicious extra component is referred to as an alias, an 


effect we will discuss in detail in the next section. 


1/2 - e e 


n=Q, 
o— -_—-® secs 
0 1 2 3 


Figure 7.14 Discrete Fourier Series Spectrum for cost Sampled Four Times 
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Now let’s see how the coefficients give us back the sampled values of the function. 
Equation 7.79 written for this example becomes 


3 3 
= Sige = Soaet*, (7.87) 
Inserting the values for c, and noting that t, = kT,/(2N) = ka/2, 


fe = fl = &) = xe + xe. (7.88) 


The RHS of Equation 7.88 sums to the proper, real value of cos ¢ at the sample times 
t = 0, w/2,7, and 37/2. At other values of ¢ # %, the RHS of Equation 7.88 
does not equal f(s) and is not even pure real! In order for Equation 7.76 to be a good 
representation of the original f(t), it seems we must sample the signal many times. 
We will quantify this statement in the following sections and in the next chapter. 


7.4.3 Aliasing 


In the previous section, the discrete Fourier series technique was applied to the 
function f(t) = cos¢ and it was sampled four times. The resulting spectrum, shown 
in Figure 7.14, had a legitimate component at w = 1 and an erroneous one at w = 3. 
We called the # = 3 component an alias. What is the source of this alias, and what 
parts of a discrete spectrum are to be trusted? 

The alias arises from the simple fact that when four sampling points are used, the 
two functions cos ¢ and cos 3¢ give exactly the same values at the sampling points 
(0, 1/2, 1, 37/2, 277). We can remove the alias at w = 3 by sampling more times per 
period. If cos ¢ is sampled eight times, instead of four, the new spectrum appears as 
shown in Figure 7.15. The component at w = 1 remains unchanged, the component 
at w = 3 is now zero, but there is an alias at w = 7. And, as you probably guessed, 
the alias occurs because cos 7t and cost have the same values at the eight sampling 
points (0, 7/4, 7/2, 32/4, 1, 51/4, 32/2, 77/4, 277). 

The sampling theorem, which we will prove in the next chapter, says in order to 
accurately portray a signal by sampling, you must sample at least twice the highest 
frequency present in the signal. If this is done with a discrete Fourier series, the 
resulting spectrum can be trusted(i.e., there will be no aliasing) up to half the sampling 


/2 + ° BD Se a Ste As SS -@ 


Figure 7.15 The Spectrum of cost Sampled Eight Times 
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frequency. Therefore, the spectrum of Figure 7.14 is valid up ton = w, = 2, while 
the spectrum of Figure 7.15 is valid up ton = @, = 4. The signal 


Ff) = cost + cos 3r (7.89) 


has a period 7, = 27 and must be sampled at a frequency of 6, or six times over that 
interval, in order to trust the sampled spectrum up to n = @, = 3. 


7.4.4 Positive and Negative Frequencies 


The normal exponential Fourier series contains terms for both positive and negative 
frequencies. The discrete Fourier series, the way it was developed in the discussion 
above, created a spectrum that ranged in » from 0 to (2N — 1) or 0 = a, = 
2a(2N — 1)/T,. To establish this orthogonality condition of Equation 7.73, however, 
it is only necessary that 7 take on 2N consecutive values. The discrete series analysis 
can therefore be set up with all the sums over n starting at some arbitrary integer, say 
No, and ranging to (2N + nm, — 1) where 2N is still the number of sampled points in 
the period. Equation 7.76, for example, could have been written as 


2N+ny-| 


j= Ye eer, (7.90) 


n=N 


The elements of the [M] matrix are generated by the same set of k values, but a shifted 
set of 7 values: 


eitnk/N Mg ZnSe (2N tino = 1) 


Mrk = 0<k<QN-1) 


(7.91) 
Except for these modifications, the development of the discrete Fourier series would 
be unchanged. 

This flexibility in selecting the value of n,, coupled with the sampling theorem, 
allow discrete Fourier series spectra to be set up more like the regular exponential 
Fourier spectra. If we set n, to —N, then the sums over n range from —N to (N — 1) 
and the spectra that result will range in frequency from —27N/T, to 277(N ~ 1)/T>. 
If, in addition, the signal is sampled at a rate of at least twice the highest frequency 
present, there will be no aliasing in the spectrum that results from the discrete Fourier 
series analysis. Taking such a range for n, the two spectra shown in Figures 7.14 and 
7.15 would become as shown in Figure 7.16. In this way a discrete Fourier series 
spectrum can be generated that more truly represents the real Fourier spectrum of 
the signal. The crucial step here is to know the highest frequency present. This is 
sometimes accomplished by passing the signal through a filter that limits the highest 
frequency to a known value and then sampling at twice this rate or faster. 

This way of looking at the discrete Fourier spectrum allows another interpretation 
of aliasing. The spectral component at n = 3 in the spectrum shown in Figure 7.14 
can be looked at as an “alias” of the true component at n = — 1 of the upper spectrum 
in Figure 7.16. The spectral component at n = 7 of Figure 7.15 is an alias of the 
true component at 7 = —1 of the lower spectrum in Figure 7.16. These spectral 
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Figure 7.16 The Discrete Fourier Series Spectrum for —N = n S (N — 1) 


components are being moved around by the shifting that occurs in the [MM] matrix as 
n, is changed, as described by Equation 7.91. 

Notice that with the almost symmetric choice for the range of n demonstrated by 
the spectra in Figure 7.16, the discrete Fourier series representation for the sampled 
values of cos t becomes 


te <= c eittnte 
5 =n 


= COS tk. (7.92) 


The RHS of Equation 7.92 is not only pure real and equal to cost at the sampled 
values, t = t, but is also pure real and equal to cost for all values of ¢! 


EXERCISES FOR CHAPTER 7 
1. Show that if f(1) is a periodic function with period T,, then 


F(t) = ft + 30). 


A Fourier series for f(t) can be constructed that uses w, = 27/3T,. Show that 
this is the same Fourier series that is generated using w, = 27n/To. 


2. Determine the coefficients of the Fourier series for the functions x(t), y(t), and 
z(t) shown below: 
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x(t) 


, 2(t) 


3. Consider the exponential Fourier series for f(x) with the form 


fO= DO eee. 


n=—0 
Answer the following questions without actually determining the Fourier coeffi- 
cients: 

(a) If f(x) = sin? x, what are the values for the w,? 

(b) If f(x) = sin? x, is Cg Zero Or nonzero? 

(c) If f(x) = sin? x, are the c, pure real, pure imaginary, or complex? 

(d) If f(x) = sin* (x — 7/2), are the Cc, pure real, pure imaginary, or complex? 
(e) If f(x) = sin? (x — 7/4), are the c, pure real, pure imaginary, or complex? 
Now determine all the exponential Fourier series coefficients for f(x) = sin? x, 


without doing any integrals. 


4. Determine the sine/cosine Fourier series that represents the following periodic 
function: 
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f(t) 


Determine the exponential Fourier series coefficients for this same function. 


. Find the period and coefficients for a Fourier sine/cosine series that represents 
the following nonperiodic functions over the specified range. Make a plot of the 
original function and the periodic Fourier series. 

ik fO=2P O<t<a7 

ii, fO=e O<t<27 

. Consider two Fourier series S(x) and 52(x). The first series S (x) represents the 


function y;(x) = x? over the range 0 < x < 1, and S>(x) is equal to the periodic 
function y2(x) where 


y2(x) = x O0<x<1 
yo(x) = yo(x +1) for all x. 


YQ) 


y2(Xx) 


ix 


(a) Make a labeled plot of S,(x) for all x. 
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(b) Let 


Si(x) = S- a,e** and So(x) = S> be, 


n=—e n=—2 


Determine the k,, and, without evaluating the coefficients, find a relationship 
between the a,, and b,,. 


(c) Now evaluate the a, and b, coefficients. 


7. Consider the periodic function f(t) = f(t + T,) where 


0 -T,/2<t<-7 
f= (cost -m<t<+7 
0 +a<t<+T,/2 


This function can be expressed as an exponential Fourier series 


wo= ce, 
(a) Make a plot of f(f) vs f. 
(b) What are the values for w,? 
(c) From your answer to part (a) determine Cy. 
(d) Are the c,, pure real, pure imaginary, or complex? 
(e) Determine the c,,. 


8. Consider the function 


f= > &e- nT), 


n=—o 


where n is an integer and 6(x) is the Dirac 6-function. 

(a) Make a labeled plot of f(t) vs t. 

(b) Determine the fundamental frequency and the coefficients a, and b, of the 
sine/cosine Fourier series such that 


S- 6(t ~ nT) = 2 + So an cos (@,t) + So bn sin (@,f). 
n=1 


n=—e n=] 


(c) The 6-function is highly discontinuous, and based on the discussion of 
convergence in this chapter, we should suspect some odd behavior with the 
Fourier series representation of this function. What is unusual about the 
coefficients you determined in part (b)? 


9. Evaluate 


T, 
| dt cos (w,t) e", 
0 


where w, = 2an/T, and n and m are both integers. 
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10. If a function can be expressed as a Fourier series 


2 


fo= >> cee 


n=—-@ 
and its derivative can also be expressed as a Fourier series 
df(t) . i0, 
SSS C 2, f ; 
dt De Ln€ 
n=—@ 
how are c, and C,, related? How are w, and 0, related? 


11. If g(t) is periodic with period T, and is represented by a sine/cosine Fourier series 
with coefficients a, and b,, determine the coefficients of the Fourier series that 
represents the function 


dg(t) | d*g(t) 


LOS ae ae 


Repeat this problem with g(t) represented by an exponential Fourier series. 
12. Consider the differential equation 


a + x(t) = f), 


where f(¢) is a periodic with period T, and in the interval 0 < t < T, is given by 


_f-1 0<t<T,/2 
fO=4 53 To/2<t<To ” Se, 


(a) Make a plot of f(r). 

(b) Determine the exponential Fourier series for f(t). 

(c) Assume x(t) can be expressed as an exponential Fourier series and find the 
coefficients. 

(d) What electronic circuit demonstrates the action of this differential equation? 
What type of electrical signals are represented by x(t) and f(t)? 


13. Consider the periodic triangular wave shown below: 


fo) 
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(a) Find the sine/cosine Fourier series for this function. 

(b) Find the exponential Fourier series for this function. 

(c) Assume that the function f(r) is the driving function for a damped harmonic 
oscillator, and the response of the oscillator is given by the function x(t) so 
that 

d?x(t) dx(t) 
+A—— + = f(t). 
m 77 A a kx(t) = f(t) 

Find the Fourier series for the response x(1). Is it easier to use your answer 

to part (a) above and find the sine/cosine series for f(t) or your answer to 

part (b) and find the exponential series for x(t)? 


(d) What happens to the response x(t) if T, = 2ar./m/k? 
14. Go through the arguments for developing the discrete Fourier series and show 


that the orthogonality conditions holds as long as the range of nisn, Sn = 
(2N + n, — 1), where 2N is the number of samples taken and n, is any integer. 


15. Consider the periodic function f(t) shown below: 


(a) Determine the exponential Fourier series for this function and plot the Cc, VS. 
n to describe its spectrum. 

(b) Plot the function df(t)/dt. Determine its Fourier series and plot its spectrum. 

(c) Develop the discrete Fourier series for this function by sampling it six times 
during its period. Plot the discrete spectrum and discuss the aliasing effects. 


16. The function f(t) is the periodic triangular wave shown below. 


2 f(t) 
1 
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17. 


18. 


19. 
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(a) Develop its discrete Fourier series by sampling it four times during a period. 
(b) Compare this discrete Fourier series to the one for a cosine function with the 
same period and sampled at the same rate. 


Let f(t) = ¢? for all t. 

(a) Find the exponential Fourier series, S(t) that represents this function over 
the intervalO <t < 1. 

(b) Develop the discrete Fourier series for the periodic function S(t) from part 
(a) above by sampling the function four times during its period. Compare 
the coefficients found for this discrete Fourier series with the actual ones 
found in part (a). 

(c) Now sample the function S(t) six times in one period and repeat the steps in 
part (b). 

(d) Discuss the aliasing for this problem. 


Consider a periodic function with 7, = 4, given in the range —1 < ¢ < 3 by 


1 -l<rt<+1 


= 2 
fo “dP 42-1 4+1<4< +3" es 
2 


(a) Make a labeled sketch of f(t). 

(b) Find the Fourier series that represents f(t). Note that d? f/dt? = g(t), where 
g(t) is a familiar function. Find the Fourier series for f(t) by relating it to 
the Fourier series for g(t). 

(c) Develop the discrete Fourier series for f(t) by sampling it four times during 
a period. Compare the coefficients of this discrete Fourier series to the 
coefficients obtained in part (b) above. 

(d) Now find the discrete Fourier series by sampling f(t) six times during a 
period. 


Let Sy be the partial Fourier series 
NY a 
Sv) = SO cei". 
n=~N 


The mean-squared error between this partial sum and the function f(t) is defined 
as 


1 
(ey) = 7 | at | f(t) - Sy(t))’. 
o JO 


This error can be looked at as a function of the 2N + 1 c, coefficients. Assume 
Sw(t) is pure real and minimize the error with respect to the c,, i.e., set 


HEY) _ 
dc, 


0, 


n 


EXERCISES 
to generate 2N + | equations for the c,. Show that these equations lead to 


1 [” ae 
C¢, = = dt f(pe '",” 
ra 


the standard expressions for the Fourier series coefficients. 
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FOURIER TRANSFORMS 


The Fourier series analysis of the previous chapter can only be used to obtain spectra 
for periodic functions. If the independent variable is time, periodic functions must 
repeat themselves for all time. But signals in the real world have both a beginning and 
an end, and consequently cannot be analyzed using Fourier series. There is, however, 
a continuous version of the Fourier series, called the Fourier transform, which can 
handle certain types of nonperiodic functions. In this chapter, we demonstrate how the 
Fourier transform is the limiting case of the Fourier series, as the period of repetition 
grows infinitely large. 


8.1 FOURIER SERIES AS T, — « 


The Fourier series for a periodic function, such as the one in Figure 8.1, can be 
summarized by the equation pair 


+00 
fO= Yo cele (8.1) 
1 pto/2 
Cy = dt f(tje'*", (8.2) 
T, J-1,/2 


where w,, = 2an/T,. To simplify the discussion that follows, the limits of integration 
have been shifted to be symmetric about ¢ = 0. The c, coefficients are the complex 
amplitudes of the discrete frequency components which make up f(t). 

As the period T, grows larger, as shown in Figure 8.2, the Fourier series still 
generates a spectrum, but the discrete frequencies w, = 2anT, become packed 
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f(t) 


-Tf2 T/2 
Figure 8.1 Standard Periodic Function 


closer and closer together. If T, is infinite, as shown in Figure 8.3, the function f(t) 
is no longer periodic. Equation 8.2 can no longer be used to generate a spectrum 
because of the 1/7, factor, and the Fourier series analysis breaks down. 

Functions like the one in Figure 8.3 can be analyzed with the Fourier transform, 
which can be derived by carefully considering the Fourier series equations in the 


f(t) 


-T,/2 T/2 
Figure 8.2 Periodic Signal with Increasing 7, 


f(t) 


Figure 8.3. Typical Nonperiodic Signal 
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limit as TJ, — oo. In this limit, the discrete w, frequencies merge into a continuous 
frequency parameter 


o, = —w (8.3) 
oO 
and we can treat Aw, the step size between the discrete frequencies, as a differential 
variable dw 


2 
Aw = = + do. (8.4) 
To 

To find the equations equivalent to the Fourier series pair given by Equations 8.1 
and 8.2 in the limit of T, — ©, insert Equation 8.2 into Equation 8.1 to obtain 


© 1 T,/2 
fO= > Bl Neds fee een (8.5) 


jae T>/ 


Notice that in this equation a dummy variable of integration 7 has been introduced 
to avoid confusion with the independent variable t. Rearranging Equation 8.5 and 
substituting 1/7, = Aw/27 gives 


2 T,f2 
f(t) = —elat / ss dt pine] ; (8.6) 


o 


As T, goes to infinity, a, —> w and Aw — dw, so the sum over n becomes an integral 
over w. Thus in this limit, Equation 8.6 becomes 


f(y = ao ‘2 dw e' E- is dt fine ; (8.7) 


where we have arbitrarily split up the factor of 1/27 in anticipation of things to 
come. 

The bracketed term on the RHS of Equation 8.7 is called the Fourier transform of 
Ff () and is a function of the continuous variable w. It is generally a complex function, 
so we represent it with the symbol F(w). With this definition for F(w), Equation 8.7 
can be broken into two equations 


1 ™ : 
Fi — dt ev iat 8.8 
Fw) = = fF are fe (8.8) 
Flt) = = iA dw e™ F(a), (8.9) 


which are often referred to as the Fourier transform pair. They are the continuous 
extensions of Equations 8.1 and 8.2 for nonperiodic functions. These equations are 
sometimes written using the operator symbol F: 


ORTHOGONALITY 253 


F(@) = Fif(p} (8.10) 
f= F-{E(@)}, (8.11) 
where 
f= = | dt ei (8.12) 
277 J—« 
and 
F=f doe (8.13) 
277 J—~ 


are both operators. 


8.2 ORTHOGONALITY 


The orthogonality condition associated with the exponential Fourier series, which 
allowed the coefficients to be evaluated, took the form 


To ; 
| dt e@n~ mt = TS, (8.14) 
0 


where w, = 2n/T, and n was an integer. A similar orthogonality expression exists 
for the Fourier transform. As long as the Fourier transform F(w) exists, Equation 8.7 
can be rearranged as 


fO= / ° dt f(t) E / dos et (8.15) 


Because the definition of the Dirac delta-function is 


00 


f®O= | drf(1)8(t — 7), (8.16) 


the bracketed term on the RHS of Equation 8.15 must act like a shifted 5-function: 


_ | dw e!'—” = &(t — 7). (8.17) 
277 Jo 


Interchanging the variables gives the inverted form: 


a ae 
— dt e@- © = §(w — w'). (8.18) 
27 J_« 


These two equations are the orthogonality conditions we seek. A function e! is 
. —Fay!, . 
orthogonal to all other functions e“’' when integrated over all t, as long as w # w!. 
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The relations in Equations 8.17 and 8.18 cannot be viewed as rigorously correct, 
because they involve a 5-function which is not enclosed in an integral. The relations 
should be viewed as a shorthand notation and are mathematically valid only if they 
appear inside integral operations. 

The orthogonality relations given in Equations 8.17 and 8.18 can be used to gen- 
erate one of the equations of the Fourier transform pair (Equations 8.8 and 8.9) from 
the other. Operating on Equation 8.8 with { dw e'*” and then applying Equation 8.17 
gives 


co ; 1 00 eo) ; 
/ dw e'*"F(w) = == | dt f) i. dw ee 
—7 Qa —o —« 


foe) 


1 
? V 27 J—-~ 
= /2af(r), (8.19) 


which is simply Equation 8.9. The same process works in reverse by operating on 
Equation 8.9 with f dt e~ and using Equation 8.18. 


dt f(t)276(t — 1) 


8.3 EXISTENCE OF THE FOURIER TRANSFORM 


In order for the Fourier transform to exist, we must be able to perform two processes. 
First, the transform integral itself must exist. That is, given an f(t), we must be able 
to calculate F(w) from Equation 8.8. Second, when we apply the inverse transform, 
we must retrieve the original function. In other words, after calculating the F(w), we 
must be able to apply Equation 8.9 and get back f(t). 

In order for the Fourier transform of a function, f(t), to exist, it must be possible 
to perform the integration, 


/ dt e F(t). (8.20) 
One simple condition derives from the fact that 
le | = 1, (8.21) 
SO we can write 
2 dt e~™ F(t) 


Thus a sufficient condition for the existence of F(w) is that the integral on the RHS 
of Equation 8.22 be finite. That is, 


= [ dt |f (2). (8.22) 


/ ra Ifo << (8.23) 


Functions which obey Equation 8.23 are called absolutely integrable. 
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The second issue, whether the inverse transformation returns the appropriate orig- 
inal function, is more subtle. If F(w) exists, it can be substituted into Equation 8.9 to 
give Equation 8.15. Then by the orthogonality condition of Equation 8.17, it would 
appear that if F(w) exists, the inversion must return the original f(t). This indeed 
will be the case for all well-behaved, continuous functions. But the sifting action of 
6(t — T) is only really well defined for functions that are continuous at t = 7. To 
determine what happens when the function is discontinuous at t = 7 takes a little 
more thought. 

One way to look at this situation is to substitute a sequence function 6,(t — +) for 
the bracketed term in Equation 8.15 and then take the limit as n —+ «, The Fourier 
inversion equation then becomes 


lim i dt f(t)8,(t — 7). (8.24) 


If a Gaussian sequence function is used, the functions in the integrand can be graphed 
as shown in Figure 8.4. From this figure, it is clear that in the infinite limit we can 
write 


ft-0)+ftte 
5 ; 


If f(t) is a continuous function, then the RHS of Equation 8.25 becomes f(t) and the 
Fourier inversion returns the original function. If the function f(t) is discontinuous 
at some point, say ¢ = fo, the inversion returns the average value of f(t): 


lim / dt f(7)8,(t — 7) = lim (8.25) 
n— 00 —o Ee 


lim / afORe-n= ATS, (8.26) 


where f; and f, are the values of f(t) just to the left and right of the discontinuity, as 
shown in Figure 8.5. 

Depending upon how f(t) is defined at the discontinuity, this may or may not be 
the correct value for f(t = t,). For example, if f(t) were defined as 


f@= 1 ee (8.27) 


f(t) 


Figure 8.4 Integrand Functions for the Fourier Inversion 
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oO 


Figure 8.5 Fourier Inversion at a Discontinuity 


so that f(t,) = fj, the Fourier inversion returns the incorrect result given in Equa- 
tion 8.26. This same effect was seen in the discussion of the convergence of Fourier 
series in the previous chapter. 


8.4 THE FOURIER TRANSFORM CIRCUIT 


The expression for f(f) in terms of its Fourier transform F(a), 
fO= as / dw e' F(a), (8.28) 
V 27 J-o 


implies that the function f(*) can be made up of a continuous sum of oscillators. This 
can be seen by considering the integral in Equation 8.28 as the limit of a Riemann 
sum. Remember that a Riemann sum views an integral, such as the integral of g(x) 
over all of x, to be the limit of a sum of rectangular areas, 


if dx g(x) = Jim, s g(nAx) Ax, (8.29) 


n=—o0 


where Ax is the width of each rectangle. Applying this expression to Equation 8.28 
gives 


FQ) = lim, a : ie ei wy, (8.30) 


Equation 8.30 represents a sum of terms, each oscillating as e"4®", with an amplitude 
given by AwF(nAo)/ J2t. Similar to what we did in the previous chapter with 
Fourier series, we can view this sum as the electric circuit shown in Figure 8.6. The 
value of f(t) is the sum of the voltages produced by all the oscillators. As Aw goes 
to zero, both the spacing between the frequencies of the oscillators and the amplitude 
of each oscillator gets infinitesimally small. The sum, however, combines to form the 
finite value of f(#). 


THE FOURIER TRANSFORM CIRCUIT 
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Figure 8.6 The Fourier Transform Circuit 
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8.5 PROPERTIES OF THE FOURIER TRANSFORM 


In this section, a number of mathematical properties of the Fourier transform are 
developed. Each derivation involves either one or both of the functions f(t) and g(t), 
which we will assume possess the valid Fourier transforms: 


FIFO) = F(a) (8.31) 
Fig()] = Glo). (8.32) 


8.5.1 Delay 


The Fourier transform of f(t — f,), which is f(1) delayed by t,, can be expressed in 
terms of F(w). The Fourier transform of the delayed function is 


1 < : 
FL — 11 = ee i dt eft — 1). (8.33) 
V 2m J-x 
Make the substitution a = f — f, to obtain 


FUG - 1%) = sal da e ielate) #(q) 
TT J—@ 


ae eile - i da e F(a) 
vV WT Ja 


= e % FL f(a]. (8.34) 


So when a function is delayed by an amount f,, its Fourier transform is multiplied by 
a factor of e7 i, 


8.5.2 Time Inversion of Real Functions 


The Fourier transform of f(—1), if f(t) is a real function, is the complex conjugate 
of F(w). To see this, start by writing the Fourier transform of the inverted function as 


FIf(-)] = at e™ F(—1). (8.35) 
sal 
Substitution of a = —tf gives 
Fi ol = veh da e'”* f(a) 
“da el! F(a), (8.36) 


-sEl 


with the result that, if f(z) is pure real, 
Fif(—1)] = E"(@). (8.37) 
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8.5.3 Even and Odd Functions 
If f(t) is a real, even function then 

f(-) = fO, (8.38) 
and by Equations 8.31 and 8.37 

F*(@) = F(). (8.39) 


Thus, for even functions, F(@) must be a pure real function of w. 
If f(t) is a real, odd function then 


f(-) = -fO, (8.40) 
and by Equations 8.31 and 8.37 
F*(w) = —F(@). (8.41) 


Therefore, for odd functions, F(w) must be a pure imaginary function of w. 


8.5.4 Spectra for Pure Real f(é) 
Functions which are pure real have transforms which obey an important symmetry. 
This can be derived easily by substituting —w into Equation 8.8 to give 


1 a 
es | apes): 8.42 
ag bec te f(t) (8.42) 


Consequently, if f(t) is pure real then 


F(-@) = 


F(—@) = F*(@). (8.43) 


This means that the negative frequency part of the spectrum can always be obtained 
from the positive frequency spectrum. The negative frequency part of the spectrum 
contains no new information. 


8.5.5 The Transform of df(t) /dt 


The Fourier transform of the derivative of f(t) can be expressed in terms of the 
transform of f(t). To see this, start with the transform of the derivative: 


df _ 1 - ~ ion F 
aca = Jin gees ae (8.44) 


The RHS of Equation 8.44 can be integrated by parts. Let u = e-' and dv = 
(df/dt) dt. Then 


F B = m fem sol, = i: dt Cine" pe}. (8.45) 
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If f(t) — 0 ast —+ +, which must occur for any function with a valid Fourier 


transform, then 
df|_ iw Bs ae 


= iwF(w). (8.46) 


8.5.6 The Transformations of the Product and Convolution 
of Two Functions 


The Fourier transform of the product of two functions can be expressed in terms 
of the Fourier transforms of the individual functions. The Fourier transform of the 
product of f(t) and g(t) is 


1 ~ ; 
F [feo] = = [ae sosw, (8.47) 
TJi-0 | 


To express this transform in terms of F(w) and G(w), insert the inverse transforms for 
F(® and g(t) into Equation 8.47, making sure to use different variables of integration 
to keep all the integrals independent: 


FU (Og(} = a | at e™ x a i dws! e*"'G(w') 


x a i dw" el" E(w"), (8.48) 
V 27 J-« 


Now collect all the 1/1/27 factors, and combine the integrals over t, w’ and w” into 
a three-dimensional integral. The RHS of Equation 8.48 becomes 


3 
1 ise] [>] ise] 7 ; 4 
= dt | do! i dos" [G(w')F(w" eo Fo" 7), (8.49) 
= | a de —0 oo 


The order of integration is arbitrary. If we choose to do the r integral first, we are 
confronted with the integral 


/ dt elo’ +o"-o — 278(w!' + w" — w), (8.50) 


which is just the orthogonality condition for the Fourier transform. Using this in 
Equation 8.49 gives 


= ‘: du fe dw" [G(w')F(w")8(w! +o" — w)|. (8.51) 
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Performing the integration over dw” produces the final result 
] CO 
FBO} = a [ 40! Go) Bw ~ (8.52) 
WT Jw 


The operation on the RHS of Equation 8.52 is a common one and is called the 
convolution of G(w) and F(w). Convolution plays an important role in the application 
of Green’s functions for solutions of differential equations, which is discussed in a 
later chapter. The convolution of G(w) and F(w) is often written using the shorthand 
notation 


G(o) @ E(w) = / duo! G(w)F(o — 0). (8.53) 


From the form of Equation 8.53, it is clear that 
G(w) & F(@) = F(@) ®& G(w). (8.54) 


Due to the symmetry of the Fourier transform pair, it is clear that the Fourier 
inversion of the product of two transforms will also involve convolution: 


1 


“MF (w)G = 
F {F(w)G(w)} Ja 


f(t) © a) (8.55) 
or 


FASO © gO} = V27 F(w)G(o). (8.56) 


Convolution can be somewhat difficult to visualize. The process is facilitated by a 
graphical interpretation demonstrated in the following example. 


Example 8.1 Consider the convolution of two functions: 


A(t) = f(t) @ at) = / dt f(t)gt — 7) (8.57) 


Let f(¢) be a square pulse and g(¢) a triangular pulse, as shown in Figure 8.7. The value 
of A(t) at a particular value of ¢ is equal to the area of the integrand of Equation 8.57. 


f(t) i g(t) 


1 2 3 


Figure 8.7 Example Functions for Convolution 
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f) g(t-t) | 


1-4 t-2 t 


Figure 8.8 Convolution Functions Plotted vs. + 


This area can be represented graphically as the area under the product of f(7) and 
g(t — 7) plotted vs. r. The functions f(r) and g(t — 1) plotted vs. + are shown in 
Figure 8.8. The function g(t — 7) plotted vs. 7 is the function g(7) inverted and shifted 
by an amount ¢. In Figure 8.8, 9(¢ — 7) has been plotted for a negative value of t. 
The integrand of the convolution integral is a product of these two functions, and the 
value of the convolution is the area under this product. 

To determine h(t), we must consider this process for all possible values of t. 
For this example, start with t < 0. A combined plot of f(r) and g(t — 7) for this 
condition is shown in Figure 8.9, which clearly indicates there is no overlap of the 
two functions. Their product is therefore zero, and h(t) is zero for this value of t. 
In fact, from this picture it is clear that h(t) = O for all t < 1 and all t > 7. The 
convolution of these two functions can be nonzero only in the range 1 < t < 7, 
where a little more analysis is needed. Within this range there are three subregions 
of t to consider: 1 < t < 2;2 < t < 6; and 6 < t < 7. The nature of the overlap 
between the two functions is different in each case. 

For 1 < t < 2, the functions of Figure 8.9 only overlap in the range 1 < 7 < 1, 
as shown in Figure 8.10. Therefore for 1 < t < 2, 


t 
A(t) = fO @ a = | dr f(t)g(t — 7). (8.58) 


Over the integration range f(T) = 2. The function g(t — 7) is given by 


(2/5(t~-~ 7) O<(t-—7T)<5 
0 otherwise : 


Gupel 


g(t-T) we 
/ 


Increasing t 


t-4 t-2 t ' 1 2 


Figure 8.9 Convolution Functions for t < 0 
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f(t) g(t-t) 


g(t-t) {9 


' J ¢ 2 


Figure 8.10 Convolution Picture for 1 < t¢ < 2 


The range 0 < (t — r) < 5 is equivalent to (t — 5) < r < t. Therefore, over the 
range of the convolution integral in Equation 8.58, g(t — 7) = (2/5)(t — 7), and 


A(t) = f() ® g(t) = i dr(4/5\t—-7r) 1<t<2. (8.60) 
1 


The result is 


h(t) = f(t) @ g(t) = =f = a + ; 1<t<2, (8.61) 
Up to this point, A(t) has been obtained for the ranges ~~ < ¢ < 2 and7 < tf, as 
shown in Figure 8.11. 

For 2 < ¢ < 6, the triangle function is shifted further to the right, so that the 
square pulse is totally inside it, as shown in Figure 8.12. The convolution integral in 
this region becomes 


2. 
h(t) = | drfingt-)  2<1<6. (8.62) 
1 
“ho 
oe Fee 
5 ‘ 
Se 
°12 7 


Figure 8.11 Convolution Result for —© <¢<2and7<t 
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f(t) g(t-D) 
\ 


Figure 8.12 Convolution Picture for2 <t<6 


As before, f(7) = 2 and g(t — t) = (2/5)(t — 7), so Equation 8.62 integrates to 


BSS 


A(t) = 2<t<6. (8.63) 


5° 5 
The convolution function A(t) has now been determined for the ranges —© <¢ <6 
and 7 < t as shown in Figure 8.13. 

For the last region, 6 < t < 7, the triangular pulse has moved to the right so only 
its trailing edge is inside the square pulse, as shown in Figure 8.14. The convolution 
integral for this range becomes 


2 
h(t) = | IDE D C27 ST. (8.64) 
1-5 


Again, f(7) = 2 and g(t — 7) = (2/5)(t — 7), so that the integration evaluates to 


Op oe AD 
= opt itt <t<7. : 
A(t) sf 5! 5 6<t<7 (8.65) 


12 7 


Figure 8.13 Convolution Results for —-0 <¢<6and7<+t 
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f(t) g(t-t) 


2 
5 
Figure 8.14 Convolution Picture for 6 < t <7 


6 hit) 
5 


"1267 


Figure 8.15 Convolution Results for —9% < t < 0 


The final result for the convolution of f(t) with g(t) is shown in Figure 8.15. 
As you can see from this example, a good deal of care must be taken to perform a 
convolution integral properly! 


8.5.7 Correlation 


The correlation process performs a measure of the similarity of two signals. It has 
several important practical applications. The most important is its ability to pick a 
known signal out of a sea of noise. The cross-correlation between f(t) and g(t) is 
defined as 


Ie) 


Welt) = i, dt f(r)g(t — t). (8.66) 


The cross-correlation of a function with itself, 


Wry(t) = i dt f(r) f(t — 1), (8.67) 
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f(t) 


g(t-t) Increasing t cA 


t t+ t+3 +5 he Us: 
Figure 8.16 The Cross-Correlation Operation 


is called an autocorrelation. It is frequently used to measure the pulse width of fast 
Signals. 

The correlation operation is quite similar to convolution described in the previous 
section, except the second function is not inverted. The t-variable shifts g(r) with 
respect to f(r) so that the functions of the integrand of the correlation operation 
appear as shown in Figure 8.16 if f(t) and g(t) are as defined in Figure 8.7. In 
general, the correlation operation requires the same care in breaking up the integral 
as is necessary for convolution. 

The Fourier transform of the cross-correlation of two functions can be expressed 
in terms of the individual Fourier transforms of the two functions. The derivation 
follows the same steps used for convolution, and the result is 


F { / “aioe= o} = /Im F(w)G(-). (8.68) 


8.5.8 Summary of Transform Properties 


The properties of the Fourier transform are summarized in the table below. In this 
table a double arrow < is used to indicate the transform and its reverse process. 


AG) oad F(w) 
a(t) o> G(w) 
f(t — to) o ete F(a) 
df(t)/dt + iwF(w) 
fg(t) & (1/27) @ Go) 
fO@a = 29 F(w)G(w) 
fC.dt fie(r—t) 3 — V2mE(w)G(-) 
f(-t) ~ F*(w) 
f(-) = fO + F*(w) = F(w) 
f(-)=-f® @ EX) = -E) 
f@) = f*® o F(-@) = F*(w) 


The last four relations are true only if f(t) is a pure real function of t. The others are 
valid even if f(t) and g(t) are complex functions. 
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8.6 FOURIER TRANSFORMS—EXAMPLES 


This section derives the Fourier transforms of several common functions. In each 
case, we have attempted to explore an important technique or principle associated 
with the Fourier transform. 


8.6.1 The Square Pulse 


Consider the square pulse shown in Figure 8.17 with 


_ fi —-T/2<t<T/2 
f= ‘a otherwise ; (8.69) 
Clearly 
/ dt |f()| = T <@, (8.70) 


so this function is absolutely integrable and should have a valid Fourier transform. 
Using the definition of the transform, 


T/2 


Fal 
2a J-T/2 


This integration is straightforward and evaluates to 


1 \ (-e7i) 1h 
V20 ( iw ) eas 
2 /2 jae >) (8.72) 


This spectrum is plotted in Figure 8.18. It sits inside an envelope given by \/2/(aw7). 
As w — 0, it has the finite limit of T/ J/2n, because both the numerator and 
denominator in Equation 8.72 are going to zero at the same rate. The first zero of 
F(w) occurs when sin(wT/2) = 0 or at w = (277)/T. The other zeros are periodic at 


F(w) = dt ei", (8.71) 


F(@) 


Hl 


f(t) 


—— 1 


-T/2 . T/2 
Figure 8.17 The Square Pulse 
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Figure 8.18 Spectrum of a Square Pulse 


« = 2amn/T where n is an integer. This function is so common that it has been given 
its own name: 


SIN = sinc x. (8.73) 
x 


We can easily find the area under any F(w) spectrum by evaluating the inverse 
Fourier transform of Equation 8.9 at ¢ = 0: 


/ lies F(w) = V/27f(0). (8.74) 


—o 


Therefore, for the square pulse 


‘ dw F(w) = V2. (8.75) 


2 


The area under the spectrum of the square pulse is finite and independent of T. 

This has interesting consequences for the spectrum in the limit of T — ©, From 
Figure 8.18, notice that as T increases, the value of the spectrum at w = 0 increases, 
and all the zero crossings move toward w = 0. This happens in such a way that 
the area under the spectrum remains constant. In the limit of T — ©, therefore, 
this spectrum has all the qualities of a Dirac 5-function. This can be confirmed by 
combining the formal expression for F(w) with the orthogonality condition derived 
in Equation 8.17: 


= V 2778). (8.76) 
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Remember, however, that this is not a rigorous equation, because the 6-function is 
not inside an integral. The Fourier transform for the function f(t) = 1, is not really 
defined, since 


F(w) = / dt ei (8.77) 


does not converge. But in a casual sense, the 6-function in Equation 8.76 makes 
perfect sense, because when it is inserted into the inverse Fourier transform, we get 
back our original function f(t) = 1: 


re iz dw e \/278(w) 


=) 


dw e' F(w) = 


Yaal. 


We will explore the transform properties of the 5-function in greater detail in the 
following sections. 


8.6.2 Transform of a 6-Function 


In the light of the previous example, it is appropriate to ask for the Fourier transform 
of 5(t). Again, we will suspend the rigorous treatment of the 5-function and allow it to 
exist outside of integral operations. The 5-function is certainly absolutely integrable, 
because 


/ at |f(Ol = / dt 8(t) = 1, (8.78) 
and the transform itself is easy: 


F(w) = dt e (S(t) 


=, (8.79) 
V2 
The inversion of this transform function becomes 
| elt 
—= dw ——, (8.80) 
Vv 277 J-o Vv 21 


which, using Equation 8.17, can be interpreted as 6(t). 

The Fourier transform of a 6-function can be used to obtain the transform of a 
square pulse. If f(t) is the square pulse shown in Figure 8.17, its derivative is the sum 
of two 6-functions 


te ” = &(t + T/2) — &(t —T/2). (8.81) 


270 FOURIER TRANSFORMS 
The transform of these two shifted 5-functions, using Equation 8.34, is 


F{8(t + T/2) — &(t — T/2} = ae — ¢ eT /2), (8.82) 


Because this is the transform of the derivative of the square pulse, Equation 8.46 says 
the spectrum of the square pulse is given by 


[eiet/2 ~ iw /2) 


F(@) = 


—e 
iw/ 27 


_ V2 sn(or/)} a 
T w > : 


which agrees with the result we found earlier in Equation 8.72. 


8.6.3 Transform of a Gaussian 
A normalized Gaussian pulse has the general form 


242 


ft) = eer”, (8.84) 


Va 


where the leading constant normalizes the function, as we will prove shortly. A graph 
of Equation 8.84 is shown in Figure 8.19. Notice how the parameter a scales both 
the width and the height of the pulse. 

The area of the pulse is given by 


Area = / Viagra (8.85) 
—o 7 


Ti 


lla. 
Figure 8.19 The Gaussian Pulse 
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and can be evaluated with a standard trick. First, make a substitution of x = at: 


Area = —— af. dxe*. (8.86) 


Now consider an equation for the square of the area: 


=f ave? [ dye” 
-if- ax f dy e+). (8.87) 


Notice the final integral of Equation 8.87 can be viewed as an integral over the entire 
Cartesian xy-plane. By changing the variables to a polar form with the substitutions 
x° + y? = p? and dx dy = p dp dO, Equation 8.87 becomes 


[Area]” 


oo 20 
[Area]? = < [ dp ; dO pe’. (8.88) 


The d@ integration simply gives a factor of 27, and the dp integration that remains is 
straightforward. The final result is 


[Area]” = 1. (8.89) 


Therefore, 
/ dt “Le? =], (8.90) 
00 Ja 


The area under the normalized Gaussian is one, independent of the value of a. 
The Fourier transform of the normalized Gaussian is 


dt mee i 


F(o) = 


JV lH 
a 2,24; 
a dt ee Fiat) (8.91) 
am/2 ie 


This integration can be evaluated by completing the square. A quantity y is added 
and subtracted from the argument of exponential function, 


(a7? + iwt + 7) —-y, (8.92) 


so that the expression in the brackets of Equation 8.92 forms a perfect square. That is, 
we want to be able to write a1? + iwt + yas (at + B)*. The value of B is therefore 
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determined by 
a?t? + iwt + y = (at + BY 
= at? + 2aBt + B’. (8.93) 
From Equation 8.93, 
2aBt = iwt (8.94) 
so that 
p= (8.95) 
2a 
and 
2 
2 @ 
= p* = -—. 8. 
y=B 402 (8.96) 
Equation 8.91 can now be written as 
F(w) = Sze 40") i dt en lartio/ ay? (8.97) 
> a2 —o0 


This integration is accomplished using the substitution x = at + iw/(2a) and 
dx =a dt: 


1 2 2 a 2 
F(@) = e740") dxe* 
arV/2 2% 


1 
= eo 14a?) (8.98) 
V2 


Thus we have shown the Fourier transform of a Gaussian is another Gaussian: 


F| ae 1 


= ee Nhe"), (8.99) 


V2 


As q@ increases, the width of the Gaussian pulse decreases, while the width of 
its Gaussian spectrum increases, as shown in Figure 8.20. This demonstrates an 
important general property of the Fourier transform. The width of the function f(t) 
is always inversely proportional to the width of the transform F(w). In the case of the 
Gaussian, the width of f(t) is roughly 


At 


R 


, (8.100) 


while the width of F(w) is approximately 
Aw = 2a. (8.101) 
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f(t) 
F(@) 
ols 
0) 
—_—_—C_—_—_—oOOCOCO 
20 
Figure 8.20 The Gaussian and its Fourier Transform 
F(@) 
edhe 
J2n 
<——__> — 
@ 
Figure 8.21 The Gaussian and its Fourier Transform in the Limit a — 
Thus the product of the two is a constant: 
AwAt = 2. (8.102) 


In general, for any functional shape, you will find that A wAr ~ c, where c is aconstant 
determined by both the functional form and the precise definition of the “width” of a 
function. In the wave theory of quantum mechanics, this is the mathematical basis of 
the Heisenberg uncertainty principle. 

The Gaussian can also be used to obtain the Fourier transform of the Dirac 6- 
function by looking at the limit of the Gaussian as a — ©, In this limit, as can be 
seen from Figure 8.21, the Gaussian goes to 6(¢) and its Fourier transform goes to 


V/V2n. 


8.6.4 Periodic Functions 


There is no formal Fourier transform for a periodic function, because the integral of 
Equation 8.20 cannot be performed. This limitation vanishes, if you are willing to 
allow Dirac 6-functions to appear outside of integral operations. 
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As an example, consider the function f(t) = sint. Its Fourier transform is given 
by 


_ i is —iwt i it pit 
AG Laie E (e e | 


Se Ser ae it(—w+1) it(—@-1) 
dt [e —e | 


7 Vim 2 00 


-i/% [5(@ + 1) -— 6 — Dj], 


where we have made use of the informal orthogonality condition, Equation 8.18. 
Figure 8.22 shows this transform pair. 

Fourier transforms which contain 6-functions, such as Equation 8.103, can be 
viewed as an alternative way of expressing the traditional Fourier series. Essentially, 
the 6-function converts the inverse Fourier integral into a Fourier series summation. 
For example, the inverse Fourier transform of the sine function is 


= "doe {in/Z [aw + 9 - dw ~ v)} 
7 J-0 


(8.103) 


2i 
= > cei, (8.104) 
where, for this function, all the c,, are zero except forc, = 1/(2i)andc_, = —1/(2i). 


This function has a periodicity of T, = 27 and so w, = n. This is simply the Fourier 
sum for the sine function. 


f(t) = sin t 


F(@) = ivn/2 [5(@+ 1)-8(@ -1)] 


Figure 8.22 Fourier Transform of sint 
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Consequently, if you are willing to consider Dirac 6-functions outside of integral 
operations, the Fourier transform of periodic functions can be taken. In general, this 
transform will be a series of 5-functions which, when inserted into the inversion 
integral, generates the normal Fourier series. With this interpretation, a separate 
formalism for the Fourier series is unnecessary, because all the properties of the 
Fourier series are included in the Fourier transform! 


8.6.5 An Infinite Train of 6-Functions 


An interesting transform to consider is that of an infinite train of 6-functions: 


fi) = SS 8 - nT). (8.105) 


n=—% 


Because this is a periodic function, its Fourier transform must be composed of a sum 
of 5-functions. This is, however, a fairly difficult function to transform and must be 
done in steps. 

We begin by taking the Fourier transform of a finite train of 6-functions: 


S(t) = y 8(t — nT,). (8.106) 


This function is absolutely integrable in the sense of Equation 8.23, and thus has a 
legitimate Fourier transform: 


i 


1 loa) : Mo 
—= dte 5(t — nT, 
V2T I. 2 2 eee 


Te. a8 dt e™8(t — nT) (8.107) 


Fn,(w) 


i 


i 


1 Ga | 
Se e iwnT> ; 
V 20 n=—Mo 

Because fj,,(¢) is a real, even function of ¢, its transform is pure real for all values 
of w. 

The frequency dependence of this Fourier transform can be interpreted graphically. 
The transform is a sum of phasors, each of unit magnitude and at an angle of — wnT, 
in the complex plane. One such phasor is shown in Figure 8.23. With w = 0 all the 
phasors are at an angle of zero radians, as shown in Figure 8.24. From this figure, it 
is clear that the result of the sum is a single phasor given by 


2m, +1 


ae 


F,,(@ = 0) = (8.108) 
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imag 


(2m,+ 1) 
Figure 8.24 Fourier Transform for a Finite Train of 5-Functions Evaluated at w = 0 


As w increases from zero, the phasors in Figure 8.24 begin to curl up. The phasor for 
n = Oisstill at zero radians. The other phasors at +n are at angles of + wnT,. Because 
of the symmetry, the result is still a single phasor with a phase of zero radians, but 
its magnitude has decreased, as shown in Figure 8.25. When w = 21/Tp, all the 
phasors are again aligned at zero radians so that F,, (w = 27/T,) = F,,,(@ = 0). 
The Fourier transform repeats this maximum when wT, is any integer multiple of 277. 

A closed form expression for F,,, (@), for arbitrary values of w, can be obtained 
graphically. To see this consider a simplified version of the phasor sum 


n=tmp, 


a eit (8.109) 


Figure 8.25 Fourier Transform for a Finite Train of 6-Functions as w Increases from Zero 
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Sin[(2m,+ 1)x/2] 
Sin(x/2) 


Figure 8.26 Phasor Summation 


Equation 8.109 is a sum of 2m, + 1 phasors, each one of unit magnitude and at 
an angle of x radians relative to its predecessor in the complex plane. Figure 8.26 
shows a geometric method for evaluating this sum. A point P is chosen which is 
equidistant from every intermediate point on the diagram. This distance is easy to 
determine, because the length of segment AB is one, and the angle 2 APB is x. This 
makes AP = 1/{2sin(x/2)}. Then, because angle ZAPJ = (2m, + 1)x, segment 
AJ, which is the sum we seek, is given by 

a= 3 zine — Sin [xQ2mp + 1)/21 


sin(x/2) Cre) 


n=—m, 


Putting the result of Equation 8.110 back into Equation 8.107 shows that the Fourier 
transform of a finite train of 5-functions is given by 


1 sin{w7,(2m, + 1)/2] 
J2T sin(wT, /2) 


Figure 8.27 shows a graph of this function for m, = 3. 

We obtain the transform of an infinite train of 5-functions by taking the m, — © 
limit of Equation 8.111. Notice, as m, increases, the height of each peak in Figure 8.27 
increases, while the distance between the peak value and the first zero shrinks. The 
distance between peaks remains a constant. The frequency of the oscillations in 
between these peaks increases without limit, so their effect cancels out inside of any 
integral, much like the sinc-sequence function described in Chapter 5. 

It looks like we have all the characteristics of an infinite train of 6-functions! In 
order to validate this, we need to evaluate the integral area of one of the peaks. This 


F,,(@) = 


(8.111) 


278 FOURIER TRANSFORMS 


Peak Value = (2m. +1)/ V2n 


= 


E,(@) 


fm,=t (1) 


lalate 
WOR een ewommenane 
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Figure 8.27 Fourier Transform for a Finite Number of 5-Functions 


is given by 


a/T > : 
pore doo sin[w7,(2m, + 1)/2] 
MeO J /T, of 27 sin(wT, /2) 


An easier method than grinding through the integral is to consider the triangular area 
shown in Figure 8.28. The magnitude of this area is J270/ T,. AS my increases, the 
area of the triangle is a better and better approximation to the area of the integral. In 
the limit, they are the same. 

Thus, the Fourier transform of an infinite train of unit area 6-functions separated 
by At = T, is another infinite train of 5-functions of area J20/ T,, separated by 
Aw = 2n/To: 


r s dt — ato} = vin > 5(@ — 27n/T,). (8.113) 


To 


(8.112) 


n=—0 n=-~00 


T,(2m,+ }) 
Figure 8.28 Triangular Solution for the 6-Function Area m, 
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Area = 1 f(t) 


To 2 


To 


Figure 8.29 Fourier Transform of an Infinite Train of 5-Functions 


This is shown in Figure 8.29. Notice that as the 5-functions get closer together in ¢, 
the 8-functions of the spectrum move farther apart. 


8.6.6 Transform of a Periodic Function Using Convolution 


The transform of an infinite train of 6-functions can be coupled with convolution to 
find the Fourier transform of any periodic signal. Consider the function h(t), which 
is a periodic train of square pulses, as shown in Figure 8.30. The pulse width is f,, 
che pulse height is 1, and the distance between the start of one pulse and the start of 
‘he next pulse is T,. Because h(t) is a periodic function, its Fourier transform H(w) 
must be composed of 5-functions and can be obtained using a Fourier series integral. 
However, h(t) can also be looked at as the convolution of two other functions 


Ait) = ft) @ gs), (8.114) 


where f(t) is a single square pulse of height 1 and width t,, centered at ¢ = 0, and g(t) 
is an infinite train of unit area 6-functions separated by T,, as shown in Figure 8.31. 

Because h(t) is the convolution of f(t) with g(t), its Fourier transform is just 20 
times the product of F(w) and G(w). These Fourier transforms have already been 
determined in previous examples: 


F(w) = 2 eee) (8.115) 


h(t) 


Figure 8.30 Periodic Train of Square Pulses 
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f(t) Area = | g(t) 


t 


° 


Figure 8.31 The Signals f(t) and g(t) which Convolve to Form A(t) 


G(w) = (8.116) 


n=—o0 


Consequently, 


H(w) = 3 5(w — 2an/T,). (8.117) 


n=—o0 


20m [sel 


This is an infinite train of 6-functions with a 6-function located at every @ = 2an/T,, 
each with an area weighted by the sinc function. If it is assumed that the pulse width 
is small compared to the period, i.e., t, < T,, the spectrum for this function looks as 
shown in Figure 8.32. 


8.6.7 Periodic Burst 


In the real world, there is no such thing as a periodic signal going on for all time. Real 
Signals must start and stop, even if they look periodic over a long interval of time. 


H@) 


Figure 8.32 Spectrum for a Train of Square Pulses 
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Figure 8.33 A cos(w,t) Burst 


The spectrum for signals of this type can be obtained using convolution techniques. 
Consider the function 


. _ | COS(Wpof) —tp<t<t, 
so \e otherwise (18) 


shown in Figure 8.33. We could calculate the Fourier transform of this signal using the 
standard Fourier integral. The transform is more easily obtained, however, by realizing 
that A(t) is the product of two functions whose transforms are already known. That 
is, h(t) = f(Hg(t), where f(£) is a square pulse of width 2, and unit amplitude, and 
g(t) is cos(@,t). We already know the transforms of these two functions are 


F(w) = V2 sinter) 


G(@) = Vz [5(@ — w,) + 5(w + @)] . (8.119) 


Because h(t) = f(t)g(), 

1 
V2 

i iI a 
V2 J-« 
Figure 8.34 shows a diagram of this convolution operation. 

In this convolution, G(@ — w’) is the sum of two 6-functions. This makes the 
convolution integral relatively easy, because the 5-functions will just sift out the 


value of F(w) where they overlap. If the expressions for F(@) and G(@) are inserted 
into Equation 8.120, the sifting property of the 5-functions results in 


1 sin[(@ + @p,)to] é sin[(@ — @,)to] 
J/2n @ + Wo @ — Wo , 


H(@) F(@) © Go) 


dw! F(w')G(w — o’). (8.120) 


H(w) = (8.121) 
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Figure 8.34 Convolution Picture for the Burst Function 


h(t) 


Figure 8.35 Transform of a Burst Signal 


as shown in Figure 8.35. Notice how each 6-function creates a shifted version of 
F(q@), and the two are summed to give H(w). If w, is much larger than 277/t,, the two 
copies of F(w) are separated and are essentially distinct. On the other hand, if , is 
on the order of 27r/t,, there is significant overlap. 

This transform has some interesting limits. One of these limits is when ft, — ~. 
In this case, the burst signal becomes more like an ideal periodic cosine signal. As 
you would expect, the pair of sinc functions in the transform become more like 6 
functions. Figure 8.36 depicts this limit. A second interesting limit is when w, — 0. 


h(t) 
H(w) 


Figure 8.36 Burst Transform as f, —> © 
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h(t) 


Figure 8.37 Burst Transform as w, —> 0 


In this limit, A(t) essentially becomes a square pulse, because cos(w,t) -> 1. As you 
would expect from Equation 8.72, the transform approaches a single sinc function 
centered at w = 0. This is shown in Figure 8.37. 


8.6.8 Exponential Decay 


Consider the exponentially decaying function 


_ f0 t<0 
f® _ 2 t> ) > (8.122) 


where a, is a real, positive number, as shown in Figure 8.38. This function is 
absolutely integrable, and its Fourier transform is easy to calculate: 


F(a) dt e'e7 


| 


yeh 
1 1 

= 8.123 

a Ce). 7 


Notice that this transform has both a real and an imaginary part. This is because f(t) 
has neither odd nor even symmetry. 


f(t) 


7 


Figure 8.38 Exponential Decay 
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@-plane 


Figure 8.39 Inversion of Exponential Transform in the w-Piane 
Now consider the inverse transform back to f(#): 


pe, ela 1 
f= = [awe eo 5 (sx)| 


] 00 eit 
= —— dw —, (8.124) 
271 J. @ — IM 


In Equation 8.124, w is a pure real variable. The integral is easily calculated by 
extending the integral into the complex w-plane, and then closing the contour. The 
complex integral is 


eat 


1 
fO = ami |,42 Boia (8.125) 


where the Fourier contour F is along the real axis, as shown in Figure 8.39. A single, 
first-order pole sits at @ = iag. 

If t > 0, we can get away with closing the contour using a semicircle of infinite 
radius even though the denominator of the integrand falls off only as fast as 1/w. To 
justify this, we must show that the contribution along the path N, shown in Figure 8.40, 
vanishes as its radius goes to infinity. On M, w = Rye!” and dw = iR,e'"d6, so 


dw = ii dé 


~~ @— 1H Rom fo Roe®® — ia, 


/ eit 7 iR,e'e! ‘of COS 9 Rot sin 6 
m a 
Nn 


Figure 8.40 Upper Half-Plane Semicircle for Fourier Inversion 
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iR 8 piRot cos 8 e Rot sin 6 
, o 
= lin). ae i ae ain 
Roo Jo Roe’ 
iif . . 
_ im, d0 ietRot cos 8a — Rot sin a (8.126) 
‘O 


Now consider the magnitude of the contribution from 1: 


eiat 
dw : 
al QM —~ 1B 


= lim 


Roo 


lim 


Roe 


7 
i de ie*Rot cos 8 Rot sin 8 
0 


7 


= lim | d@e ®t sin? 


Ru, 0 


a {2 ; 
<= jim 2 [ d@ e~ Rot sind (8.127) 


The second step in this sequence comes from the fact that |ie'*-'"®| = 1, while the 
last step is true because the function is symmetric about @ = 7/2. 
There is an important result, called Jordan’s inequality, which can be used at this 
point. The inequality states that, for t > 0, 
a/2 


= : 7 
d@e Rot sin@ — 


= : 8.128 
; Rt ( ) 


This is proved quite easily by noticing that sin @ = 20/a for0 = 6 = 7/2. Using 
this result in Equation 8.127 gives 


lim 
Ry, 


dw 
nN w@~— 


= 0. (8.129) 


This shows that for ¢ > 0, the contribution from vanishes as the radius grows 
infinite, as long as F(w) falls off as 1/|c| or faster. 

For t < 0, we cannot close the contour in the upper plane as before because the 
integrand diverges for large, positive imaginary numbers. Instead, we must change 
the contour to close in the lower half plane. By arguments similar to the ones given 
above, you can show there is no contribution added when this contour is closed. That 
is, 


it 
[2 fC =0 1<0, (8.130) 
U 


@ ~ 1 


where the contour U is a semicircle of infinite radius in the lower half w-plane. 
Consequently, the integration of Equation 8.125 can be performed by closure 


fone a $485 
— ia,’ 


(8.131) 
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@-plane Cfort>0O 


Figure 8.41 Closure of the Fourier Inversion Integration 


where the contour C is closed in the upper half w-plane for ¢ > 0, and in the lower 
half @-plane for t < 0, as shown in Figure 8.41. 

For each contour, the integral can be determined by adding up the residues of 
the enclosed poles. The t < 0 contour has no enclosed singularities, and the t > 0 
contour has just one, with a residue of e~ %'. Thus our final result is 


fO= eee sae (8.132) 


which is our original function. 

Now, consider what happens as a, approaches zero. This makes the f(t) look 
more and more like a step function, as shown in Figure 8.42. For any a, bigger than 
zero, we can obtain a valid Fourier transform, but if a, = 0, we know we cannot take 
the Fourier transform, because the integral 


1 f* 
F(@) = ae : dte™ (8.133) 
0 


does not exist. 


f(t) 


Figure 8.42 The Exponential Function in the Limit of a, — 0 
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Figure 8.43 Modified Fourier Contour to Handle Exponential Function in the Limit of 
a, - 0 


By looking at what happens to the inversion contour for this function in the a, — 0 
limit, we can gain a valuable alternative viewpoint. As long as a, is greater than zero, 
no matter how small, the pole of the integrand of Equation 8.125 is in the upper half 
w-plane, and the F contour can remain on the real w-axis. But when a, = 0, the pole 
sits right on top of the ¥-contour and the integral cannot be performed. 

Notice, however, that if the F-contour is deformed to F’ by dodging the pole, as 
shown in Figure 8.43, the inversion integral 


1 eit 
= ~— — wl 
f= 5 im a (8.134) 
reproduces the unit step function 
_ f0 t<0 
fO = tS 1>0° (8.135) 


This is no coincidence, and it will lead us to a more general integral transform called 
the Laplace transform. 


8.6.9 Damped Sinusoid 


AS a last example, consider the damped sinusoid 


t<0 


_ f0 
f@= foe sin(a@ t) t>0’ (8.136) 


where a, and w, are both positive numbers. Figure 8.44 shows this function. The 
Fourier transform is easy to calculate: 


1 oo ; imet _ ,—iwWet 
== [ dt e '%e7 as i ) 
V 27 JO 2i 
- w,/~ 27 


(w — w,)(@ — aw)’ 


F() 


(8.137) 
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Figure 8.44 The Damped Sinusoid 


where @, = @ + ia, and w@. = —@, + ia,. Notice F(w) is a complex function 
because f(t) has neither odd nor even symmetry. 
The inversion transform of Equation 8.137 is 


1 = : —@,/V 20 
a do eb —___—_+—__., .138 
i) V20 iz ee (@ — @;)(@ — w2) Ga) 


As with the exponential decay function of the previous section, the best way to 
evaluate this integral is to extend it into the complex plane, and close the complex 
contour. The complex version of the inversion integral is 


1 —@,// 27 
aa. | ggg Sal Ve 8.139 
Flt) =p wc oD (8.139) 


where F is the Fourier contour along the real w-axis, as shown in Figure 8.45. Also, 
like the exponential decay example, we are forced to close the contour in the upper 
half-plane for t > 0 and the lower half-plane for t < 0. This is due to the influence 
of the e*“ term in the numerator. As you will prove in one of the exercises at the end 
of the chapter, Equation 8.139 returns the original function f(t), 


_f90 t<0O 
JAS eee sin(w,f) t>0’ 


(8.140) 


as long as a, > 0. 


Figure 8.45 Inversion of the Damped Sinusoid Transform in the w-Plane 
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f() 


Figure 8.46 The Undamped Sinusoid 


The Fourier transform and inversion for the damped sinusoid work correctly for 
all values of a, > 0, no matter how close to zero a, gets. Exactly ata, = 0, however, 
we know we cannot have a valid Fourier transform, because now f(t) becomes 


t<0O 


_ fo 
fO= ee 1>0 erly 


as shown in Figure 8.46. The sinusoid is no longer damped, and the Fourier integral 
does not exist. This problem is also evident from looking at the inversion contour. 
With a, = 0, the Fourier transform in Equation 8.137 has become 


w/ J 2m 


Fe) ~~ = wo + 0)" 


(8.142) 


and the two poles in Figure 8.44 now lie on the Fourier contour. 
Notice, however, Equation 8.142 still can be used to generate the undamped 
sinusoid 


f(t) = —— | dw eiat —@_/¥ 277 . 
V 2a JF! (@ — @o)(@ + Wp) 


if the Fourier contour is modified to ¥’, as shown in Figure 8.47. In fact, even an 
exponentially growing sinusoid, 


(8.143) 


Figure 8.47 Modified Contour to Handle an Undamped Sinusoid 
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Figure 8.48 Exponentially Growing Sinusoid 


Figure 8.49 Modified Contour for an Exponentially Growing Sinusoid 


0 t<0 
IO = 1 ea sin(@ ft) t>0’ Ge) 
such as the one shown in Figure 8.48, can be recovered from this inversion, if we let 
the ¥'contour dip below the poles of F(w), as shown in Figure 8.49. 
These deformed contours seem to extend the Fourier inversion into realms where 
you would not expect it to work. In the next chapter we formally introduce these 
extensions as the basis for the Laplace transform. 


8.7 THE SAMPLING THEOREM 


One of the more important results of Fourier transform theory is the Sampling The- 
orem. This theorem determines at what rate a signal must be sampled so enough 
information is available to properly reproduce it. This is important, because sampling 
too infrequently will cause the reproduced signal quality to degrade, while requiring 
a sampling rate which is too high is inefficient. 

Consider sampling an audio signal for recording onto a compact disc. Let the 
original microphone signal a(t) look as shown in Figure 8.50. Furthermore, let this 
analog signal have a Fourier transform A(w), as shown in Figure 8.51. This spectrum 
vanishes above and below some cutoff frequency Wm2,, which is perhaps imposed by 
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a(t) 


Figure 8.50 The Original Analog Audio Signal 


the mechanics of the microphone. For the purposes of this discussion, the transform 
will be assumed to be pure real. This assumption is not necessary for the validity of 
the Sampling Theorem, but is convenient for our drawings. 

A useful way to model the sampling process, and the signal recorded on the CD, 
is to let the sampled signal, which we will call a,(1), be equal to the original signal 
times a periodic set of 5-functions 


a(t) = a(t) > 8(t — nT). (8.145) 


n=—0o 


The time between the 5-functions is the sampling interval. Figure 8.52 shows this 
representation. 

Because a,(t) is the product of two functions, the spectrum of a,(t), which we will 
call A,(a), is the convolution of A(w) with the transform of the train of 5-functions: 


ok V2 = = 
SO ae ga oe a 2am/T,). (8.146) 


When the sampling rate is very high, that is when 7, < 217/Wmax, As(o) has the form 
shown in Figure 8.53. The spectrum of a,(t) is not the same as the spectrum of a(t), 
because it has many more high frequency components. If it were reproduced through 


A(®) 


@ 


— Omax max 
Figure 8.51 Fourier Transform of the Original Signal 
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Figure 8.52 The Sampled Signal 


an amplifier and speaker system, it would not sound like a(t). However, sitting in the 
spectrum of a,(t), from —Wmax tO +@max, is the exact spectrum of a(t). Thus, if we 
send this signal through a low-pass filter before amplification, the result should be a 
sound that reproduces a(t). 

A simple low-pass filter can be made from the capacitor and resistor circuit shown 
in Figure 8.54. The input voltage v;,(t) and output voltage v,,;(t) are related by the 
differential equation 


vin(t) = ao Tan) + Voul(t). (8.147) 


Applying a Fourier transform to both sides of this equation gives an algebraic equation 
relating the Fourier transforms of the input and output waveforms: 


Vin(@) = (iWRC + 1)V a4) (8.148) 
or 
Vou (@)/Vin(@) = EME TE, (8.149) 
A,(@) 


: 4n/T, 


-4n/T, -2n/T, 2n/T, 


2 Wmax 
Figure 8.53 Spectrum of Sampled Signal 
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R 


+ el | + 
Vin(t) Cc T Vourl®) 


Figure 8.54 Simple Low-Pass Filter 


The quantity V,,,,(w)/V,,(@) is called the transfer function for the filter. The 
magnitude of this function is shown in Figure 8.55. For frequencies much less than 
w = 1/RC, the output voltage and input voltage are essentially identical. For fre- 
quencies much above w = 1/RC, the output is zero. This transition frequency is 
called the cutoff frequency of the filter, w,,. For a simple RC filter, such as this 
one, the transition at the cutoff is not very sharp. Better filters can be made with 
a more complicated combination of components, or with more sophisticated digital 
processing components. The ideal low-pass filter has the transfer function shown in 
Figure 8.56. 

If we pass the sampled signal through an ideal low-pass filter with @,5 = @max, 
the output of the filter will have a Fourier transform A ;(w) that is exactly equal to 
A(@), and consequently the signal coming out of the filter a(t) will sound exactly 
like the original a(t) signal from the microphone. The filter has removed the spurious 
high frequency components introduced by the sampling. This process is shown in 


1 
1+i@RC | 


-V/RC | VRC 


Figure 8.55 Transfer Function for a Simple RC Low-Pass Filter 


Vou(O)/Vin(@) 


Figure 8.56 Ideal Low-Pass Filter 
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A;-(@) 
Low Pass 
fa] 
— Dax @nax 
a,(t) Low Pass “0 
——* | Filter —_ 


Figure 8.57 Frequency and Time Domain Pictures of the Filtering Process 


A,;(@) 


Figure 8.57. The top picture in Figure 8.57 shows the filtering process in the frequency 
domain, while the bottom picture shows the same process in the time domain. 

In the previous arguments, we made the important assumption that the original 
signal was sampled at a high rate. The spectrum for A,(w) is composed of a series 
of A(w) spectra replicated every » = 2n7/T,. As long as 7/T, > max, these 
spectrum do not overlap, and our previous analysis works fine. But when T, grows so 
large that 17/7, < @max, the individual spectra overlap each other and A,(@) looks as 
shown in Figure 8.58. Now, the output of the low-pass filter with @,4 = @max gives 
the spectrum shown in Figure 8.59. In this case the signal coming out of the filter 
would no longer sound like the signal from the microphone. 

Thus we see, in order to preserve the information in a signal, it must be sampled 
such that 77/T, > max. This means the sampling frequency 27/7, must be at least 
twice the highest frequency present in the sampled signal. The minimum sampling 
frequency is often called the Nyquist sampling rate. 

Keep in mind that we have drastically simplified the process of CD sampling in 
our previous argument. Some practical issues that introduce complications should 
be pointed out. First, in real CD sampling, the recorded audio signal is digitized 


A;(@) 


-4n/T, -2n/T, 


2n/T, 4n/T, 


2 Wax 
Figure 8.58 Spectrum of an Undersampled Signal 


EXERCISES 295 


Af(@) 


— Dmax Omax 
Figure 8.59 Spectrum of the Filtered, Undersampled Signal 


and can take on only discrete values. This requires the use of an analog-to-digital 
converter (ADC). The recreation of the original signal then requires the inverse step 
of a digital-to-analog conversion (DAC). Second, for an audio signal, the practical 
limit for w,,,, is about 20 kHz, the frequency limit of the human ear. The music 
industry has settled on a 44 Khz sampling standard, just above twice the maximum 
frequency we can hear. However, the microphone signal and its electronic handling 
before sampling may very well have frequency components well above 22 kHz. To 
prevent the overlap in the convolved spectra for A,(w) that would resuit by sampling 
such a signal, a low-pass filter with a cutoff frequency near 22 kHz is inserted just 
before the sampling electronics. Finally, there is no such thing as an ideal filter with 
a transfer characteristic like that shown in Figure 8.56; there will always be some 
“rolloff.” The better the filter, the better the sound reproduction. 


EXERCISES FOR CHAPTER 8 


1. What is the Fourier Transform of the function 


e 7% + ge 2at t>0 


f= {6 1<0° 


where a is real and positive? 


2. Find the Fourier transform of the function 


0 otherwise 


fa) = oe —an/2<t< 1/2 


Ts there a way to evaluate this without actually calculating the transform integral? 


3. Find the Fourier transform of the signal shown below, 


FOURIER TRANSFORMS 


f(t) 


-a 


(a) by directly calculating the transform integral. 


(b) by differentiating the signal and writing the transform in terms of transforms 
that were already derived in this chapter. 


. In this chapter, we determined the Fourier transform of the Dirac 5-function. 
Find the Fourier transform of the derivative of the Dirac 5-function, 


d&(t) 


F(t) bars he 


Compare these two transforms. 


. Find the Fourier transform of the function 


Bb O<t<1 
f= ‘0 otherwise 


(a) by directly calculating the transform integral using integration by parts. 


(b) by differentiating the signal three times and writing the transform in terms 
of the sum of two simpler transforms. 


. The Fourier transform of f(t) is 


—i 
E(w) = w — i2aw — a? 


Determine the function f(t) and make a plot of it for positive and negative values 
of t. In the limit of a — 0, what happens to F(w) and f(t)? 


. Obtain the Fourier transform of the function 


e~ %—7) sin [w(t — 7] t>T 


r= {6 peat 


Show that the inversion of this Fourier transform requires a slight modification 
to the standard ¢ > 0, t < O closure semicircles discussed in this chapter. 
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f(t) 


8. Consider the Fourier inversion 


_ 00 eikk 
fey = fake 


where x and k are pure real variables. 


(a) For x > 0, determine how this integral can be closed in the complex k-plane 
and evaluate f(x). 


(b) For x < 0, determine how this integral can be closed in the complex k-plane 
and evaluate f(x). 


9. Prove that the inversion of Equation 8.139 gives Equation 8.140. 


10. Given the functions f(t) and g(t) shown below, 


t f(t) 
g(t) ; 


(a) Make a labeled sketch of f(t) © g(t) . 
(b) Make a labeled sketch of g(t) © g(t) . 
(c) Make a labeled sketch of the autocorrelation of g(t). 
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11. Consider the two functions 


_ f 1/2 -1<t<+1 
f= to otherwise 


g(t)=t et. 


(a) Plot f(2) and g(t) for all ¢. 
(b) Find an expression for A(t) where h(t) = f(t) © g(2). 
(c) Plot A(t). 


12. Consider the two functions: 


ms ee -1<t<+1 
TO e otherwise 


(t) = 1 Oo<t<+1 
8 0 otherwise 

Make a labeled sketch of f(t) convolved with g(t). 
13. Let f(t) be a square pulse 


te 1 -f,/2 ts +I15/2 
TO = fe otherwise . 


and g(t) be made up of a delayed version of f(t) plus a sinusoidal function of 
time: 


g(t) = f(t — 107.) + sin (wot), 
where w, = 207/7,. Make a labeled sketch of the cross-correlation of f(t) with 


g(t). 
14. Show that the convolution integral is symmetric. That is, prove 


[ ars@at—9= [ aramre-o. 
15. Show that if both g(t) and f(t) are zero for t < 0, then the convolution of the 
two functions g(t) ® f(t) is also zero for t < 0. 


16. Find the Fourier transform of the cross-correlation of f(t) with g(t) in terms of 
the Fourier transforms of the individual transforms F(w) and G(w). 


17. Use the fact that the Fourier transform of a Gaussian is a Gaussian, 


a nfs 22 1 eae) 2 
sg i oe eG) 


wT V 20 
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to evaluate the integral 


—? 


On a single graph plot A(t) and e 


18. The function f(t) is a periodic train of Gaussian pulses with period 7, as described 
below. 


a <. —a(t—nT)? 
HOP ae (nT) 


n=—© 


1c(3) 


-2T -T T 2T 


This function can be looked at as the convolution of a single Gaussian pulse 
with a train of Dirac 6-functions. Using this, determine the Fourier transform of 


f(t). 
19. Find the Fourier transform for the periodic function f(t) shown below. 


f() 


i) 


Look at f(t) as the convolution of two functions. 
20. Find and plot the Fourier transform for the periodic function f(t) where 


e % sint O<t<67 
fHm=¢40 67 <t< 607. 
f(t + 607) all t 


21. 


22. 
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f(t) 


60 t 
6X 


To evaluate real definite integrals, we have often used the technique of closure 
in the complex plane. We used a similar process to invert several of the complex 
Fourier transforms developed in this chapter. To be able to use closure, the 
functions being integrated must meet certain conditions. Show that the conditions 
for closure are different for a real, definite integral than they are for a Fourier 
inversion. 

For this problem, consider the function: 


2 
w 
PO) aaa 
(a) Show that, if the integral 
| dw F(w) 


is extended into the complex w-plane, closure is not possible. 


(b) Show that the inverse transform 


f= [ . dx e' F(w) 


can be closed in the complex w-plane. 


Ultrafast lasers have pulse widths that are on the order of 50 femptoseconds (a 
femptosecond is 10~' seconds). This pulse width is difficult to measure since 
the signal is so short that it cannot be displayed directly on an oscilloscope. It is 
often measured using an autocorrelation technique with an apparatus similar to 
the one shown below: 
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Output Signal 
Proportional to 
[A(t) + B(t)/? 


A(t) 


A(t) + B(t) 


50 fs 


' 


100% 
Mirror 


The original pulse is incident from the left on a 45°, 50% mirror. A 50% mirror 
reflects half of the incident signal and transmits the other half. The 100% mirror 
is at a fixed position and reflects all of the incident light. The corner reflector 
reflects all of the light back along the incident direction, regardless of the angle 
of incidence. This reflector is mounted on a stage that oscillates back and forth at 
a frequency of 10Hz. The split signal is then recombined at a detector, which is 
usually a photomultiplier tube or a fast LED detector. The detector generates an 
electrical signal which is proportional to the total number of photons in several 
pulses of the combined [A(t) + B(t)]* signal—i.e., it integrates the intensity of 
the signal over an interval long compared to the laser pulse width, but short 
compared to the period of the oscillating stage. Notice how the motion of the 
oscillating mirror causes the relative position of B(t) compared to A(z) to change. 
The output of the detector is applied to the vertical deflection of an oscilloscope, 
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while the signal that drives the position of the corner reflector is applied to the 

horizontal deflection. 

(a) How far does light travel in 50 fs? For red light, how many oscillations of 
the electric field occur in 50 fs? 

(b) Assume a train of 50 fs pulses is incident on this device, and one pulse is 
separated from the next by 0.01 js. Assuming the motion of the oscillating 
reflector and the display of the oscilloscope are adjusted so the B(t) signal 
moves from 200 fs before A(t) to 200 fs after A(t), what does the trace on 
the oscilloscope look like? To keep things simple, assume the pulse shape is 
square. 

(c) How should the 100% mirror and the corer reflector be positioned with 
respect to the 50% mirror so that the operation in part (b) above is obtained? 

(d) How is the width of the signal on the oscilloscope related to the pulse width 
of the incident signal? Treat the motion of the reflector as linear in time in 
the region where the two signals overlap. 


9 


LAPLACE TRANSFORMS 


ni this chapter, the Laplace transform is developed as a modification of the Fourier 
ransform. The Laplace transform extends the realm of functions which can be trans- 
‘ormed to include those which do not necessarily decay to zero as the independent 
variable grows to infinity. This is accomplished by introducing an exponentially de- 
‘aying term in the transform operation. The inverse Laplace operation can be viewed 
is a modification of the inverse Fourier transform, with the Fourier contour moved 
off the real w-axis into the complex w-plane. 


4.1 LIMITS OF THE FOURIER TRANSFORM 


We can motivate the development of the Laplace transform by examining a func- 
ion which clearly has no Fourier transform. Consider the ramp function shown in 
“igure 9.1 and described by 


f= 40 ae 9.1) 


[his function does not go to zero as ¢ goes to infinity and thus does not have a Fourier 
ransform. 

However, from the previous chapter, we know that a derivative operation in the 
ime domain is equivalent to multiplication by the factor of iw in the transformed 
Tequency domain. In other words, 


{xo 


ai \ = IoF(w). (9.2) 
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Figure 9.1 The Ramp Function 


Notice that the ramp function given by Equation 9.1 is related, by two derivatives, to 
the delta function, 


d? f(t) 
at? 


= 6(), (9.3) 


as shown in Figure 9.2. In the previous chapter, we showed that the 6-function has 
the simple Fourier transform: 


(9.4) 


F {8()} = a 


-_—> F(@) 


<—- = i@F(o) 


, d°f(t)/dt?= 5(t) 


1 
ae ie 7 EO) 


Figure 9.2. Relationship Between the Ramp and 8(t) 
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So if we suspend our rigor for a moment, we can naively suppose Equation 9.2 lets 
us write the transform of the step function of Figure 9.2 as 
-i 
wr/ 27 


Applying Equation 9.2 one more time, with reckless abandon, gives the Fourier 
transform of the ramp function as 


(9.5) 


—] 
w/a 
Of course, we know neither the step function nor the ramp have legitimate Fourier 


transforms. This is confirmed by examining the inverse Fourier integral of the trans- 
form functions given in Equations 9.5 and 9.6. The inversion for Equation 9.5 is 


(9.6) 


1 i . ‘oe 
a dw e ; (9.7) 
V2 J—x wry 27 
and the inversion for Equation 9.6 is 
1 = —1 
dw e™ (9.8) 


Neither of these integrations can be performed because of the divergence of the 
integrands at w = O. If you look at the representations of these integrals in the 
complex w-plane, you will notice that a pole sits right on the Fourier contour. 

This is the problem we examined briefly at the end of the previous chapter. By 
modifying the contour to dip below the offending pole, as shown in Figure 9.3, 
we actually can recover the original function from the transform function given 
in Equation 9.6. We will show this in the next section. Our sloppy derivation of the 
ramp function’s transform must contain some grain of truth. By examining the inverse 
process a little more closely, we can arrive at the basis for the Laplace transform. 
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Figure 9.3. A Modified Fourier Contour for the Ramp Function 
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9.2 THE MODIFIED FOURIER TRANSFORM 


In this section, we show how the standard Fourier transform can be modified by 
moving the inversion contour off the real axis. With this approach, we can transform 
functions that would not normally possess valid Fourier transforms. We start with 
some of the ideas introduced at the end of the previous chapter. 


9.2.1 The Modified Transform Pair 
In the previous section, we indicated that the ramp function of Equation 9.1 could be 
recovered from the transform given in Equation 9.6 using an inversion integral 


— piwt 
er 


1 
Soha Pig 
f(t) pro oo 7a 


on the modified contour F’ shown in Figure 9.3. This contour lies along the real w- 
axis, except at the origin where it dips below the pole. The discussion of the previous 
chapter shows that we can use closure in the upper half w-plane for t > 0, and in the 
lower half for t < 0. Then, using the residue theorem, it is easy to show that 


(9.9) 


Foyt ee (9.10) 


The ramp is recovered by integrating the function given in Equation 9.6 along the 
modified Fourier contour F’. 

This, however, is not the only path that will recover the ramp function. Any path 
lying in the lower half @-plane will work. Consider the path shown in Figure 9.4 that 
goes along the straight line w; = —f, where w; is the imaginary part of w: 

eae eivt 
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(9.11) 
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Figure 9.4 Another Modified Fourier Contour for the Ramp Function 
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Figure 9.5 Inversion for the Ramp Function 


This integration is easily performed by defining a transformation of variables with 
wo’ = w + if. With this transformation, the modified Fourier contour F” that was 


along w; = —B becomes a standard Fourier contour along the real w/ axis: 
ePt —eiw't 
j= — | dwo' ——. 12 
(0) a I wl rape (9.12) 


There is a second-order pole at w’ = if, as shown in Figure 9.5. Now standard Fourier 
closure techniques can be applied to recover the same ramp function as before. Notice 
that, with the pole at w’ = iB, the e* factor present in the numerator of Equation 9.12 
is exactly canceled out by the residue. From these arguments it can be seen that the 
modified Fourier contour F' of Equation 9.9 can be placed anywhere in the lower 
half w-plane, as shown in Figure 9.6. 

The fact that the ramp function can be generated using Equation 9.6 in this inversion 
process raises the question as to whether this transform can be obtained directly from 
the ramp function itself, by extending the regular Fourier transform process into the 
complex w-plane. If we allow w to take on complex values, the Fourier transform 
becomes 


1 . , 
F(@) = — | dte' f(t). (9.13) 
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Figure 9.6 Convergence Region for the Ramp Transform 
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If @ = w, + iw;, we can rewrite Equation 9.13 as 
1 ii ; 
F(w) = = ) dt e'* [e*! F(t)} . (9.14) 
T J-n% 


Notice the integral in Equation 9.14 looks just like a regular Fourier transform of the 
function f(t)e°". 

Can this process generate Equation 9.6 from the ramp function? Inserting the ramp 
function into Equation 9.14 gives 


1 2 . 
F(@) = —— | dt e**t. (9.15) 
2a JO 


The fact that the ramp function is zero for t < 0 makes the limits of integration range 
from 0 to +0. Integrating by parts, taking u = t and dv = e7!#'dt, gives 


e iat 


foo) 1 6 
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(9.16) 


1 —te iw 
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Both terms of this expression will converge only if w; < 0. In that case, Equation 9.16 
evaluates to 


J 271w? 


Thus, at least for the ramp function, it appears we can define a modified Fourier 
transform pair by 


F(w) = «; <0. (9.17) 


F(@) = ee dt e ' f(t) (9.18) 


Jf 21 J-« 


f(t)= —== | dwe'F(w), (9.19) 
F' 


with the requirement that the imaginary part of w be negative. The convergence region 
for this modified Fourier transform is identified in Figure 9.6. 


9.2.2. Modified Fourier Transforms of a Growing Exponential 


The modified Fourier transform pair given by Equations 9.18 and 9.19 has been 
shown to work for the ramp function. Will it work for other functions that do not 
possess regular Fourier transforms? Consider the exponentially growing function 


0 t<0O 


{oO = ee +> 0” (9.20) 
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Figure 9.7. Convergence Region for Modified Fourier Transform of the Exponential Function 


where a is a positive, real number. When this function is inserted into the modified 
transform of Equation 9.18, you can easily show, as long as w; < —a, that 


F(w) = (9.21) 


—i 
V/2m(w + ia) 
Figure 9.7 shows this convergence region. Notice, as before, the convergence region 
is required to be below the pole of F(w). 

Let’s investigate the inverse operation, which must return the original function of 
Equation 9.20. Combining Equation 9.21 with the inversion integral of Equation 9.19 
gives 


1 ei 
f@) aa [40 Sa (9.22) 


The modified Fourier contour ¥’ for this integration is shown in Figure 9.7. 

The same stunt we used for the ramp function can be applied here. A simple 
variable transformation, w' = w + iB, converts F’ to a contour on the real axis of 
the w’-plane: 


iw't 


Br © 
f®=— / do! —~__ (9.23) 
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Again, the integral in Equation 9.23 can be evaluated by closing the contour. The 
complex exponential forces us to close in the lower half plane for t < O and in the 
upper half plane for t > 0. There is a single pole at w' = i(B — a) with residue 
e“e—® Thus, the residue theorem tells us 


ft) = — oe (9.24) 


which is indeed the original function. 
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9.2.3 Closure for Contours off the Real Axis 


In the previous two examples, we made a variable change from @ to w’, so the 
F' contour was changed into a contour along the real axis. The only reason we 
did this was so we could use the standard arguments developed in Chapters 6 and 
8 to show the contribution of the closing contour was zero. Actually, this step is 
not necessary. We can close directly in the w-plane, if we analyze the contribution 
from the closing contour a little more carefully. This is more work than the variable 
transformation approach, but once this general argument is understood, it simplifies 
the whole process. 

To see how this works, let’s return to the specific integration of Equation 9.22. For 
t < 0, consider the closed contour shown in Figure 9.8. In the limit of R, — ©, the 
straight line segment from A to B is the modified Fourier contour ¥’ of Equation 9.22. 
This contour can be closed, as shown, with the circular segment in the lower w-plane. 
Using the standard closure arguments, it is easy to see the contribution from this 
circular segment as R, — © is zero. Because there are no poles of the integrand of 
Equation 9.22 below F' we obtain 


fO=0 <0. (9.25) 


For t > 0 consider the closed, solid contour shown in Figure 9.9. In the limit of 
R, — ©, the straight line-segment AB again is ¥’, the modified Fourier contour. 
The contour is now closed by a circular segment BCDA that is mostly in the upper 
half w-plane, but has sections, BC and DA, that extend into the lower half w-plane. 
The contribution from this BCDA circular segment is also zero, but because of the 
sections BC and DA this fact is more difficult to show. 
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Figure 9.8 Closure for the Modified Fourier Transform for t < 0 
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Figure 9.9 Closure for the Modified Fourier Transform for t > 0 


To show that this contribution is zero, we begin by identifying the points A and B. 
The point A is located at @ = —r, ~ iB, and B is located at w = r, — iB, as shown in 
Figure 9.9. Notice that as R, — ©, r, — R,, while B remains fixed. The integration 
along the total circular segment can be integrated by parts: 


] eiat —egiat 


——— W@W ———4 
2a BCDA ~~ @ + ia 27t(w + ia) 


eit 
——_———.. 9.26 
B he de 2mt(w + ia) 0-26) 


Evaluating the first term on the RHS gives 


— gist 


27t(w + ia) 


(9.27) 
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As R, and r, — ~, the RHS of Equation 9.27 goes to zero. The integration along the 
circular segment can therefore be broken up into three parts: 
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The second integral, on segment CD, goes to zero as R, — © by the standard argu- 
nents used for the Fourier transform. This leaves only the integrals along segments 
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Figure 9.10 Closure of a Modified Fourier Contour 
BC and DA. Consider the BC segment. If w = w, + iw;, everywhere along this 
segment w, = r, and w; = —B. This implies 
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(9.29) 


As R, — ©, the integrand is dropping off as 1/R while the path length is limited 
to a length of 8B. Consequently, as R, —+ ©, the contribution from the BC section, 
for any fixed value of t, goes to zero. The same arguments can be made for the DA 
section. Therefore, for t > 0, the integration along the circular segment BCDA goes 
to zero and we can close the contour. The pole at # = —ia is inside the contour of 
Figure 9.9, and the residue theorem gives 


fd=e" ¢>0. (9.30) 


To summarize, we can close the modified Fourier contour for the growing expo- 
nential function with the closed contours shown in Figure 9.10. This approach gives 
the same result as was obtained by first transforming to the w’-plane. In the rest of 
this chapter, we will take this kind of reasoning for granted as we close different 
contours. 


9.2.4 General Properties of the Modified Fourier Transform 


The ramp and exponential function examples demonstrate the general conditions 
which allow the modified Fourier transform to exist. First, the function to be trans- 
formed must be exponentially bounded. The meaning of that expression is evident 
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from looking at Equation 9.14. It must be possible to find an w; such that e®’ f(t) goes 
to zero as f goes to infinity. If this is the case, the function e’ f(t) will have a regular 
Fourier transform. Second, the function we want to transform must be zero fort < 0. 
Actually, it is quite easy to develop the modified Fourier transform arguments so that 
f(t) must be zero for all t < f,, where f, is any real number. You can explore this 
idea further in one of the exercises at the end of this chapter. 

The modified Fourier transform of Equation 9.18 will always exist for values of 
qw with imaginary parts less than the imaginary parts of all the poles of F(w). This 
implies the F’ inversion contour of Equation 9.19 must also lie below all these poles. 
The inversion can be accomplished by closing above the modified Fourier contour 
for ¢ > 0 and below it for t < 0. 


9.3. THE LAPLACE TRANSFORM 


The standard Laplace transform is really just the modified Fourier transform we have 
just discussed with a variable change and a rearrangement of the constants. The 
variable change involves the simple complex substitution 


s= iw. (9.31) 


Because the inversion equation is frequently evaluated using the residue theorem, 
the constants are juggled around so the leading constant of the inversion is 1 /(2 772). 
With these changes, the Laplace transform pair follows directly from Equations 9.18 
and 9.19: 


F(s) = i dt e~* f(t) (9.32) 
0 
f(t) = <a if ds e* F(s). (9.33) 
271 L 


It is implicitly assumed by the form of Equation 9.32 that f(t) = 0 for allt < 0. 
These equations are sometimes written with a shorthand operator notation as: 


F(s) = L{f(} (9.34) 
ft) = LONE}. (9.35) 


The picture of the Laplace inversion in the complex s-plane looks very much 
like the pictures of the modified Fourier transform, except for a rotation of 7/2. 
Figure 9.11 shows a typical convergence region and inversion contours. The Laplace 
transform converges in the region to the right of all the poles of F(s). The Laplace 
inversion contour £, which is called the Laplace or Bromwich contour, must lie 
in this convergence region and extends from s = B —i»tos = B +i. The 
inversion contour is almost always evaluated by closing the complex integral in 
Equation 9.33. Using similar arguments given for the modified Fourier transform, the 
Laplace contour is closed to the right for t < 0 and to the left for t > 0. Because £ 
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Figure 9.11 Complex s-Plane for the Laplace Inversion 


lies to the right of all the poles, the inversion always generates an f(t) that is zero for 
t<0. 

It is the real part of s that allows the Laplace transform to converge for functions 
which do not have valid Fourier transforms. For an exponentially growing f(t), as 
long as the real part of s is larger than the exponential growth rate of f(s), the 
Laplace transform will converge. If f(¢) is not exponentially growing, but in fact 
exponentially decaying, all the poles of F(s) are to the left of the imaginary axis, and 
the convergence region will include the imaginary s-axis. In a case like this, the entire 
Laplace contour can be placed on the imaginary s-axis, and the Laplace transform is 
equivalent to a regular Fourier transform, with the exception of the fact that f(t) = 0 
for t < 0. This particular limitation will be removed when we consider the bilateral 
Laplace transform at the end of this chapter. Thus, we can really view the Fourier 
transform as a special case of the more general Laplace transform. 


9.4 LAPLACE TRANSFORM EXAMPLES 


In this section, the Laplace transform of several functions will be determined. With the 
exception of the 5-function example, these functions do not have Fourier transforms. 


9.4.1 The 6-Function 


The function 5(¢ — t,) has a Fourier transform, and if t, > 0, it is zero fort < 0. 
Therefore, we should expect its Laplace transform, except for an overall constant, 
to be the same as its Fourier transform with —is substituted for w. This is easily 
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confirmed by performing the transform: 


L£{6(t — t,)} = cE dt e *8(t — t,) 
0 
= e So, (9.36) 


There is no limit on the real part of s for performing this integration, and therefore 
the region of convergence covers the entire complex s-plane. Consequently, for the 
inversion of the Laplace transform of the 5-function, the Laplace contour can be 
placed anywhere. That is, any value of B is acceptable. Incidentally, the fact that we 
can let the Laplace contour lie on the imaginary s-axis is the reason the 5-function 
possesses a Fourier transform. 

The inverse transform that returns the 5-function is 


% l 
Ce SSS i dgete™, (9.37) 


with the Laplace contour £ located as shown in Figure 9.12. 

This is one of the few cases where the Laplace inversion cannot be done by closing 
the contour. Notice, however, because the convergence region is the entire complex 
s-plane, the Laplace contour can be placed on the imaginary s-axis. In that case, 
s = iw and ds = ida, so the RHS of Equation 9.37 becomes 


5 / dw ei!) = &(t — t,). (9.38) 
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Figure 9.12 Laplace Inversion of the 5-Function 
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Figure 9.13 The Heaviside Step Function 


This final result comes from the orthogonality relation derived in the previous chapter. 
Equation 9.37 is actually a more general form of that orthogonality relation: 


1 
—— | ds e'~%) = §(t — t,). (9.39) 
2a L 


9.4.2 The Heaviside Function 


The Heaviside step function does not possess a Fourier transform. It does, however, 
have a Laplace transform. Recall the definition of the Heaviside function is 


f(t) = i ae (9.40) 


which is shown in Figure 9.13. 
The Laplace transform of this function is 
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Figure 9.14 Inversion of the Heaviside Laplace Transform 
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This integration can be performed only when the real part of s, which we call s,, is 
positive. The Laplace inversion of this transform is 


xa fa g} 9.42 
ee a aan (9.42) 


The integrand has a first-order pole at s = 0, with a residue of 1. To be within the 
convergence region, the Laplace contour must fall to the right of this pole. It can be 
closed on the left for ¢ > O and to the right for t < 0, as shown in Figure 9.14. The 
evaluation in either case is straightforward and recovers the original step function 
given by Equation 9.40. 

This example shows why the operation described in Figure 8.43 of the previous 
chapter worked. By deforming the Fourier contour to ¥', as shown in that figure, 
the operation was really a modified Fourier transform, equivalent to the Laplace 
transform of this example. 


9.4.3 An Exponentially Growing Sinusoid 


Consider the exponentially growing sinusoid 


0 t<0 


fi) = ee iwi se (9.43) 


with a, > 0, as shown in Figure 9.15. Again, this function does not possess a 
traditional Fourier transform. Its Laplace transform, however, is given by 


F(s) - | dt ee! sin(wot). (9.44) 
0 


This integral exists only when s, > ap. In that region, 


Wo 
F(s) = Goce) 5, > Q, (9.45) 
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Figure 9.15 The Exponentially Growing Sinusoid 
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Figure 9.16 Inversion of the Laplace Transform for the Exponentially Growing Sinusoid 


where s; = @, — iw, and Ss, = a, + iw. The inversion contour must fall to the 
right of the two poles at s,; and s,, as shown in Figure 9.16. The integration for the 
inversion, 


1 st @o a 0 t<0 
Qari [as “ @-sE-s) {en sinfagt) 4 > 0° (9.46) 


generates the original exponentially growing sinusoid, as you will prove in an exercise 
at the end of this chapter. 

This example should be compared with the operation described in Figure 8.47 
of the previous chapter. We can now see why modifying the Fourier contour to dip 
below the poles gave the correct answer. Unwittingly, we were performing a Laplace 
transform! 


9.5 PROPERTIES OF THE LAPLACE TRANSFORM 
As we did with the properties of the Fourier transform, we will discuss the properties 


of the Laplace transform using the two functions f(t) and g(t). We assume both 
functions are zero for t < 0, and that they possess the Laplace transforms 


F(s) = in dte "f(a (9.47) 
0 
and 


G(s) = ie dt e * g(t). (9.48) 
0 
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These transform exist for the real part of s greater than some value, not necessarily the 
same for each function. Both these transforms can be inverted by integration along 
some Laplace contour 


1 st 
f(t) = Oni L ds e" F(S) (9.49) 
and 
1 St 
g(t) = Ini [ ds e* G(s). (9.50) 


We use two different contours, £, and L,, to allow for the possibility of two different 
convergence regions. 


9.5.1 Delay 


Consider the function f(t — t,), which is the function f(t) delayed by t,. The Laplace 
transform of this delayed function is 


Lf (t — t,)} = | dte "f(t — t). (9.51) 
0 


Make the substitution a = t — t,, so Equation 9.51 becomes 


L{f(t — to} = [ da e**"") f(a) 


ty 


= eh dae ** f(a) (9.52) 
0 
= e # F(s). 


The change in the lower limit in the second step is valid because f(a) = O fora < 0. 
A delay in time is equivalent to multiplying the Laplace transform by an exponential 
factor. 


95.2 Differentiation 
The Laplace transform of df(t)/dt is 


c{ dt \ : dt e a (9.53) 


The RHS of this expression can be integrated by parts with u = e * and dv = 
(df (t)/dt) dt to give 


WON oh es 
£ {art = f@me 


i. i dt e~"sfi(t). (9.54) 
0 0 
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If the real part of s is large enough to make f(t)e~* go to zero as t — ©, then 
df(t 
£ { oh = -f(0) + s Fs). (9.55) 


Notice the difference between this result and the derivative relationship for the Fourier 
transform. 


9.5.3 Convolution 


Now consider a function h(t), which is the convolution of f(t) with g(t): 
h(t) = dt f(1)g(t — 7). (9.56) 


The Laplace transform of h(t) can be written as 


| 7 dt e* h(t) 
0 
7 dt e*h(t) 


= / dt ee | dt f(r)g(t — 7). (9.57) 


HA(s) 


H 


In the second step, we extended the lower limit to —~ using the result of Exercise 8.15, 
which says if f(t) = 0 and g(t) = 0 for t < 0, then A(t) = 0 fort < 0. Now 
substitute the inverse Laplace transforms for f(7) and g(t — 7) to convert the RHS of 
Equation 9.57 into 


= ° ty 1 A 
/ ae f a laf ds Ee) lam J dst eG" 
a 


f(r) g(t~7) 


. (9.58) 


We have been careful to keep the inversion integrations independent by using two 
distinct variables of integration, s’ and s". Notice, the inversion for f(r) has been done 
on a Laplace contour £,, which lies within the convergence region for the transform 
of f(t). The integration variable s’ lies on £r. Similarly, the inversion for g(t — 7) has 
been done on a contour £,, which must lie within the convergence region for G(s). 
The integration variable s” lies on Ly. Figure 9.17 shows this situation, with the L, 
and £, contours placed as far to the left in their respective convergence regions as 
possible. Remember, the contours are not necessarily in the same location because 
the convergence regions of the two transforms may be different. The convergence 
region for H(s) falls to the right of all the poles of H(s). We will show that the poles of 
H(s) are just a combination of the poles of F(s) and G(s), so the convergence region 


PROPERTIES OF THE LAPLACE TRANSFORM 321 


Convergence region 
for transform of f(t) 


Convergence region 
for transform of g(t) 


Figure 9.17 Convergence Regions and Laplace Contours for f(t) and g(t) 


of H(s) is just the overlap of the convergence regions of F(s) and G(s). The variable 
s of Equation 9.58 lies anywhere in this convergence region of H(s). 

Because L£, is to the right of all the poles of F(s), and £, is to the right of all the 
poles of G(s), contour deformation allows us to move both £y and L, to the right 
without changing the values of their integrals. It will be very convenient if we move 
them to the right so that they become a common contour £ that passes through the 
point s. Remember s is the point where we want to evaluate H(s). Figure 9.18 shows 
this new configuration. In this situation, s, s', and s” will all have the same real part 
which we will call s,,. For the integrations in Equation 9.58, s,. is fixed, but the 
imaginary parts of s' and s”, which we call s/ and _s;’, vary independently. 

Using the common Laplace contour and moving the f dt operator to the right in 
Equation 9.58 gives 


-l 7 é a = Ww 
mi dt [ [ase ee] | [ast ras") | dt e& el) (9.59) 


Notice what happens to the last integral in this equation. Because s” and s have the 
same real part, that is s = S,. + is; and s” = s,. + is!, this integral is just the 
orthogonality relation: 


rl dt eli) = 2q(5!" — 53). (9.60) 


— 0 
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Figure 9.18 The Common Region of Convergence and Laplace Contour 


Applying this to expression 9.59 gives 
— | dr | ds'es™ F«)| / ds! e~*'"G(s")8(s" — sa| : (9.61) 
27 oo L —o 
where we have also made use of the fact that along £, ds” = ids/’, with sj! ranging 
from minus to plus infinity. The 5-function makes the integral over s" very easy and 


lets us write 


00 


H® == ] dt Bice ‘| dslet* ()). (9.62) 
27 5 L 
Interchange the remaining two integrals: 
H(s) = — | ds! ete | dr gel: (9.63) 
Qa L —o 


We can now pull a similar stunt to the one used on the last integral in Equation 9.59. 
Because s’ and s have the same real part, this looks like another instance of the 
orthogonality relation: 
/ dr eS) = 2778(5! — 5;). (9.64) 
Again making use of the fact that ds’ = ids} along the £ contour, we can write 
His) = / ds! [F(s)G(s)8(s! — 5] (9.65) 


= F(s)G(s). (9.66) 
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This is an elegant final result. The Laplace transform of the convolution of two 
functions is the product of the Laplace transforms of the functions 


ASO © sO} = FOG). (9.67) 


Because H(s) is the product of F(s) and G(s), the poles of H(s) are just a combination 
of the poles of F(s) and G(s). The convergence region for H(s) is therefore the 
intersection of the convergence regions of both F(s) and G(s), as we assumed earlier. 


9.5.4 Multiplication 
As a final situation, consider the product of two functions 


A(t) = fg. (9.68) 


The Laplace transform of h(t) is given by 


H(s) = | dt e~* f(t)g(t) (9.69) 
0 


a 1 1 
, dt e* E | ds'e! ne 
=e 7 


Again f(t) and g(t) have been assumed to be zero for t < 0, so that the dt integration 
of the Laplace transform can be extended to negative infinity. As with the previous 
case, the Laplace contours £, and £, are in their respective convergence regions and, 
initially, are placed as far to the left as possible, as shown in Figures 9.17 and 9.19. 

We begin the manipulation of Equation 9.69 by interchanging the order of inte- 
gration, to obtain: 


a / ds" e§"'G(s") 
21 £ 


& 


} me | 
LG ape i ds' F(s') | ds" G(s") ji are 2, (9.70) 
TW SL, L, —co 
On Ly let s’ = s), + is} and on £, let s” = s”, + is! where s/, and s/, are constants. 
If s = s, + is;, then in order for the dr integral to exist, we must have 


got sts =0. (9.71) 


If the condition is not met, the dt integral diverges and H(s) does not exist. If this 
condition is met, however, the dt integral becomes a 6-function, 


i dt efs't3"-D = i dt eltsitsi—si) (9.72) 


= 2778(s; + 5! — 5;), (9.73) 


and H(s) can be evaluated. 

The condition of Equation 9.71 constrains the convergence region of H(s). If F(s’) 
converges for s! > s, and G(s”) converges for s;’ > sz, then Equation 9.71 says H(s) 
can exist only for s, > sy + sg. This situation is shown in Figure 9.19. To see why 
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Figure 9.19 Convergence Regions for the Laplace Transform of a Product 


this is a reasonable result, imagine f(t) and g(+) are both exponential functions. Their 
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These integrations can be performed in either order. Doing the f ds/’ first gives 
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Equation 9.75 can be rewritten as 


= Sr» 


" 
ro 


ro 
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Using the condition that s 


This integration can now be converted back to one along the path L,: 
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Notice the result of Equation 9.77 is similar to the convolution integral discussed 
earlier, except now the integration is done along a complex path. The result of 
Equation 9.77 can be written in a shorthand notation as 


1 
LAF) gO} = ani [FO @z, G9). (9.78) 


If the order of the integration of Equation 9.74 is reversed the result becomes 


= = (GW) @r, Fi). (9.79) 
Wi 

We have obtained a result that is similar to the one for Fourier transforms. The 
Laplace transform of the product of two functions is the convolution of the separate 
transforms. In this case, however, the convolution must be integrated on one of the 
Laplace contours. 

In arriving at Equation 9.77 s, was determined by the locations of the £, and L, 
contours according to the condition s, = s/, +s”. As the Ly and L, contours are 
moved to the right, s, increases and consequently s also moves to the right, as shown 
in Figure 9.19. When using Equations 9.78 or 9.79 to evaluate H(s), however, it is 
convenient to take another point of view. In particular, when using Equation 9.77, 
it makes more sense to select a fixed value for s and use this value to determine 
the location of the £f contour. To do this, it must be realized that only values of 5 
in the convergence region of H(s) can be used. In this convergence region, all the 
s'-plane poles of G(s — s’) must be to the right of all the s’-plane poles of F(s’). This 
condition can be used to define the convergence region for H(s). The integration of 
Equation 9.77 is then evaluated by placing the £; contour between the s’-plane poles 
of F(s’) and G(s — s’). These ideas are demonstrated in the following example. 


£{ fo a} 


Example 9.1 The process of convolution along a Laplace contour can be demon- 
strated by a simple example. Let 


ce" t>0 
f(t) = ‘a 26) (9.80) 
and 
fem t>0 
g(t) = i p20" (9.81) 


The Laplace transforms of these functions are easily taken: 


F(S) = , (9.82) 
s—2 
with a convergence region given by s, > 2, and 
1 
G(s) = ——;, (9.83) 


s—3 
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with a convergence region s, > 3. If h(t) = f(2)g(t) then 


= Le" t>0 
h(t) = 0 1<0’ (9.84) 
and its Laplace transform can be taken directly 
i 
HA(s) = —z, (9.85) 
s-5 


with a convergence region s, > 5. 
This Laplace transform of A(t) and its convergence region can also be obtained 
using Equation 9.77 and the convolution of F(s) with G(s): 


oe PS ae 1 
HO = 39 J, 4 (z=) ere) se 


The integrand has two poles, one at s’ = 2 and another at s’ = s — 3, as shown in 
Figure 9.20. The £ contour must lie in between these poles as indicated in the figure. 
This contour can be closed to the left or to the right to produce the same result we 
obtained in Equation 9.85: 


1 
H(s) = ——z. (9.87) 
s—5 
The convolution arguments also give the same region of convergence for this trans- 
form. The pole at s — 3 must be to the right of the pole at 2, to make room for the £, 
contour. Consequently, s, — 3 > 2 and the convergence region for /7(s) is given by 
Sr > 5. 
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Figure 9.20 Convolution Integration in the Complex Plane 
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9.5.5 Summary of Transform Properties 


Some properties of the Laplace transform operation are summarized in the table 
below. In this table the double arrow < is used to indicate the reversible transform 
process. 


f(t) o F(s) 
g(t) ao Gs) 
ftt-t) e %F(s) 
df (t)/dt - —f(0)+sF(s) 
fO@®st) F(s) G(s) 
2mi f(t) g(t) - Fs) Gc, GS) 


9.6 THE LAPLACE TRANSFORM CIRCUIT 
As was the case with Fourier series and Fourier transforms, the inverse Laplace 


transform can be conceptually modeled with an additive electronic circuit. Consider 
the inverse Laplace transform integral 


fate s 
f= 5h i ds e"F(s), (9.88) 


which is an integration along the contour £, as shown in Figure 9.21. The Laplace 
contour must lie within the convergence region of the transform function which is to 
the right of all the poles of F(s). Along this contour, s = s,. + is;, ds = ids;, and s; 
ranges from —° to :. With these facts in mind, Equation 9.88 can be rewritten as 


+00 
f() = xf ds; ese F(s,. + is;). (9.89) 


If we think of the integral of Equation 9.89 in terms of its Riemann definition, it is 
the limit of a discrete sum of terms: 


ft) = lim, 7 Flr + in si) setae (9.90) 
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Figure 9.21 The Standard Laplace Contour 
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Equation 9.90 describes f(t) as a sum of functions, each with an exponentially 
growing envelope of (1/27)F(S,. + inAs;)Asj;e~', and oscillating as e4%". 


9.6.1 A Growing Exponential 


Consider the growing exponential 


_ fo t<0 
f= ie ae (9.91) 
with a > 0. Its Laplace transform is given by 
1 
F(s) = , (9.92) 


sS-a 


The summation on the RHS of Equation 9.90 can be modeled with the circuit shown 
in Figure 9.22. There is a fair amount of information in this figure. The insert in 
the upper right-hand corner shows the s-plane with its pole, the convergence region 
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Figure 9.22 The Basic Laplace Circuit 
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Figure 9.23 The Laplace Circuit with the Laplace Contour Shifted to the Right 


of the transform, and the inversion contour. Each term in the series on the RHS 
of Equation 9.90 is represented by an oscillator. In this figure only the first four 
oscillators for n 2 0 have been shown. All the oscillators have been going on for all 
time and continue to go on for all time. When added together they generate f(r), a 
function that is zero for all t < 0, and grows exponentially for t > 0. The exponential 
growth rate of f(t) must be less than e*”’, the growth rate of the individual oscillators. 
The £ contour can be moved to the right without affecting the result of the 
integration in Equation 9.89. This means we can increase s,,, making each oscillator 
have an arbitrarily large growth rate. Nevertheless, when they are all added up, they 
generate the same f(t). This somewhat surprising result is depicted in Figure 9.23. 


9.6.2 A Decaying Exponential 


Things get even more interesting when we consider the case where the single pole of 
F(s) lies on the negative real axis. This happens in Equation 9.91 when a < 0. The 
region of convergence for the Laplace transform integration still lies to the right of 
the pole at s = a, and the inversion contour £ can be placed anywhere to the right 
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Figure 9.24 The Laplace Circuit for an Exponentially Damped Function 


of this pole. Let’s start with it in the right half plane so that s,. > 0. The circuit for 
this situation is similar to the previous ones and is shown in Figure 9.24. The inserted 
figure shows the Laplace contour and region of convergence. The sum of function 
generators produces an f(t) that is zero for t < 0. For t > 0, even though each 
function generator produces a signal that diverges as t increases, the sum is a damped 
exponential. Think about that last sentence for a second. We are adding up an infinite 
number of exponentially growing, oscillating functions which miraculously add in 
just the right way to generate a smooth, decaying exponential. Very strange, indeed. 

Because the convergence region of this transform extends into the left half plane, 
we can move JL to the left, until it sits on top of the imaginary axis with s,, = 0. In 
this case, the Laplace inversion has effectively gone to a Fourier inversion and F(is;) 
is really just the Fourier transform of f(t). The Laplace circuit is now composed 
of signal generators oscillating at a constant amplitude for all time, as shown in 
Figure 8.6. 

Let’s keep shifting £ to the left, so that it lies in the left half-plane with s,. < 0. 
The Laplace circuit for this case is shown in Figure 9.25. Now the circuit is composed 
of signal generators whose amplitudes are decaying, because S,,. is less than zero. For 
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Figure 9.25 The Laplace Circuit with the Laplace Contour in the Left Half s-Plane 


t > O the signals from the generators sum to give the damped exponential of f(t). Its 
damping rate a is greater than the damping rate of the individual signal generators 
because a < 5,,. For t < 0, even though the amplitudes of the individual oscillators 
are diverging as ¢ goes to —°%, their sum produces a signal that is zero for allt < 0. 


9.7 DOUBLE-SIDED OR BILATERAL LAPLACE TRANSFORMS 


Early on, we made the statement that the Laplace transform is more general than the 
Fourier transform and can handle any function that the Fourier transform operation 
can. However, for the regular Laplace transform, f(t) must be zero for t < 0. The 
function 


_fem t>0 
fQ= oa p20? (9.93) 


where a and a are both positive, real numbers, is absolutely integrable and clearly 
possesses a legitimate Fourier transform. But it is not zero for t < 0, and therefore 
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does not have a regular Laplace transform. So how can we make the claim that any 
function which can be Fourier transformed can also be Laplace transformed? 

It is true that the regular Laplace transform, with the restriction that f(t) must be 
zero for ¢ < 0, cannot handle a function like the one in Equation 9.93. However, a 
small modification to the regular Laplace transform clears up this problem. 

Consider the Laplace transform type operation 


F(s) = [ dt e * f(t), (9.94) 


where ft < 0 is now included in the integration, and f(t) is given by Equation 9.93. 
The obvious question immediately arises. Is there is a region of convergence in the 
s-plane where the integral of Equation 9.94 exists? Inserting Equation 9.93 into 9.94 
gives 


0 7 
F(s) = | dte fem +f dt e feu" 
) 


—a 


0 oo 
elas e-tatst 


(9.95) 


The first term on the RHS of Equation 9.95 converges for s, < a, and the second 
term converges for s, > —a,. Therefore 


a) + a 


(s+ ays — a)’ 


with a convergence region given by —a, < s, < a. The function F(s) has first- 
order poles at s = a2 and s = —a, (remember that both a, and a are positive, real 
numbers). This convergence region is therefore a strip between the poles of F(s), as 
shown in Figure 9.26. 

It seems reasonable to try to set up the inversion integral along a Laplace contour 
in the convergence region as shown in Figure 9.26. This inversion integral becomes 


F(s) = (9.96) 


1 —e*(ay + ap) 


2m Sr 2 (s + ay)(s — a)” C2) 


~~~ Convergence region 


Figure 9.26 Convergence Region for a Double-Sided Laplace Transform 
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Figure 9.27 The Convergence Region for a Function that Grows Exponentially for t < 0 


Notice, there are poles of F(s) on either side of £. In order to close this contour, we 
have the same requirements as we did for the regular Laplace transform. Because of 
the e* term, we must close to the left for t > O and to the right for t < 0. After 
evaluating the residues of the poles, it is quite easy to see the contour integral of 
Equation 9.97 generates the original f(t) in Equation 9.93. The Laplace contour can 
be placed anywhere in the convergence region with the same result. If it is placed 
on the imaginary s-axis, this double-sided Laplace transform operation is equivalent, 
except for a 20 factor, to a Fourier transform. 

If we let a, or a@2 become negative, the standard Fourier transform will fail. But, as 
long as a2 > —ay, there will be a finite convergence region and this special Laplace 
transform will exist. In cases like this, the convergence region does not include the 
imaginary s-axis, consistent with the fact that the Fourier transform operation fails. 
An example of this, with a. < 0, is shown in Figure 9.27. 

Therefore, this more general approach to the Laplace transform, which is called a 
double-sided or bilateral Laplace transform, includes all the properties of the Fourier 
transform. To use this approach we must take more care specifying the Laplace 
contour and the region of convergence. This is important, as you will see in the 
example below, because a given F(s) can generate different f(t)’s depending on how 
the convergence region is defined. 


Example 9.2. Consider the transform given by Equation 9.96 with the specific 
values a; = a2 = 1: 


2 
F(s) = --————_—-. 98 
ee (s+ I(s- 1) oe 
This function has simple poles at s = +1 and s = -1. There are three possible 


regions of convergence: s, > 1; -1 < s, < 1; ands, < —1. Laplace inversions 
using these three different convergence regions result in three different functions of f. 

For the first case, with the region of convergence given by s, > 1, the Laplace 
contour is to the right of all the poles, as shown in Figure 9.28. This is just a regular 
Laplace transform. Because there are no poles to the right of the Laplace contour f(t) 
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Ue 


Figure 9.28 Inversion for the Region of Convergence Given by Real 5 > 1 


is zero for t < 0. Fort > 0, f(t) is given by the sum of the residues of the two poles: 


fit) = ee me Ses (9.99) 


as shown in Figure 9.28. This f(t) has no Fourier transform analogue, because the 
region of convergence does not include the imaginary s-axis. 

The second region of convergence, given by —1 < s, < 1, is the region between 
the two poles, as shown in Figure 9.29. This is a specific case of the situation we 
discussed previously. The Laplace contour lies between the two poles, and now f(t) 
is no longer zero for t < 0, but is given by minus the residue of the pole at s = 1. 


Similarly, f(t) for tf > 0 is nonzero and equals the residue of the pole at s = —1, 
_ fet t<0 
f= a t>0”’ (9.100) 


as shown in Figure 9.29. In this case the imaginary s-axis is in the region of conver- 
gence, so there is a corresponding Fourier transform. 

The last region of convergence has s, < —1, and is shown in Figure 9.30. In this 
case, the contour lies to the left of all the poles, and consequently f(r) is zero for 
t > 0. For t < 0, we sum the negative of the residues of the poles to get, 


ft) = ‘> he “Eee (9.101) 


Figure 9.29 Inversion for the Convergence Region Given by ~1 < Real s < 1 
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f(t) 


“@e 


Figure 9.30 Inversion for the Convergence Region Given by —1 < Real s 


as shown in Figure 9.30. This function does not possess a Fourier transform, because 
the imaginary s-axis is not in the region of convergence. 


EXERCISES FOR CHAPTER 9 


1. Find the Laplace transform of the function 


_ fe sin (wot) t>0 
hid re t<0’ 


where a, and w, are positive real numbers. Demonstrate that you have obtained 
the correct transform by inverting it to obtain the original f(t). 


2. Consider the function 
t>-l 


_ e 


Sketch f(t) vs. f and determine its Laplace transform and region of convergence. 
Confirm that you have the right transform by inverting it to obtain the original 


f(t). 
3. Show that the Laplace transform of 


in (ki >0 
ee 


2ks 


F(s) = @rRY Spe 


Determine the region of convergence for F(s). 


4. Find the Laplace transform of the function 


_ fre t>0 
r= 4 t<0 
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(a) using the standard transform integral and integrating by parts the appropriate 
number of times. 
(b) by treating the function as a product of two other functions. 
5. Determine the Laplace transforms of the following functions. Assume the func- 
tions are zero for t < 0, and for ¢ > 0 are given by: 
(a) sinh (ke). 
(b) cosh (kt). 
(c) sin (kt). 
(d) cos (kf). 
(e) e' cost 
(f) 2”. 


6. Find the inverse Laplace transform of 
Ss 
F(s) = Gry 
with the region of convergence s, > 0. Make a sketch of f(s). 
7. Find the function that has the Laplace transform 
Es) = —. 
(s— 1 
with the convergence region s, > 1. 
8. Find the inverse transform of 


] 


where k is a real positive number 

(a) by direct integration of F(s). 

(b) by factoring F(s) and using the convolution theorem. 
(c) by expanding F(s) as a sum of partial fractions. 


9. Find the inverse Laplace transform of 


1 


F(s) = s(s — k)?” 


where k is a positive real number 

(a) by direct integration of F(s). 

(b) by factoring F(s) and using the convolution theorem. 
(c) by expanding F(s) as a sum of partial fractions. 
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10. Find the inverse Laplace transform of 


2 


F(s) = @ +B) 


(a) by direct integration of F(s). 
(b) by factoring F(s) and using the convolution theorem. 
(c) by expanding F(s) as a sum of partial fractions. 


11. Show that 


afte 
dt 


y 


t=0 


: nee 


a | =F LLf@)1 — 5 fO) - 


where L[f(t)] = F(s). Develop a general expression for 
d" f(t 
29) 


at" 
12. If 
_ ds) 
G(s) = rae 
what is g(t), the inverse transform of G(s), in terms of f(t), the inverse transform 
of F(s)? 


13. If the function 


nd 


Eo = z— 


is treated as a bilateral Laplace transform, it can have three different regions of 
convergence: 

(a) sp < —-1. 

(b) -l<s, <1. 

(c) 1<-s,. 

For each of these convergence regions, the inversion of F(s) gives a different 
function of time. Determine these three functions and make a plot of them. 


14. Find the Laplace transform of 
_ jo t>—-l 
po= {0 t<-l- 
15. Find the inverse Laplace transform of 


Ss 
£O = aoa 


assuming that the region of convergence is —1 <5, < +1. 
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16. Find the Laplace transform of f(t) where 


0 t>0 
Sin (@,t) t<0 


f®= { 


and identify its region of convergence in the s-plane. Invert this transform using 
closure to obtain the original f(s). 


17. Consider the function 


at t>0 


fl) = ie Gs 


where a and are both positive, real numbers. 

(a) Make a sketch of this function. 

(b) Why does this function not have a standard Laplace transform? 

(c) Define a transform pair that will work for this function. Discuss the condi- 


tions on a and f that allow a finite region of convergence for the transform 
function. 


10 


DIFFERENTIAL EQUATIONS 


A differential equation contains terms involving either total (d/dx) or partial (d/dx) 
differential operations. In this chapter, several methods for obtaining solutions to 
differential equations will be presented. The emphasis is on solutions to linear prob- 
lems, although some techniques for solving nonlinear equations will be discussed. 
Particular attention will be paid to Fourier and Laplace transform approaches and 
Green’s function solutions. A basic knowledge of differential equations is assumed. 


10.1 TERMINOLOGY 


The study of differential equations involves a large number of definitions. This section 
presents a quick review of some of these definitions and other important terminology. 


10.1.1 Dependent versus Independent Variables 


Two types of variables appear in differential equations. A dependent variable repre- 
sents the unknown solution to the equation and typically appears in the numerator of 
the differential operation(s). An independent variable is a parameter of the solution. 
The dependent variables are functions of the independent variables. An independent 
variable typically appears in the denominator of the differential operation(s). For 
example, in the differential equation 


dy(x) 
dx 


x is an independent variable and y is a dependent variable. The dependence of y on 
x is indicated by writing y(x). 


= y(x) + 5, (10.1) 
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10.1.2 Ordinary versus Partial Differential Equations 


Ordinary differential equations involve only one independent variable and contain 
only total differential operations. For example, 
dy(x) 
dx 
is a typical, ordinary differential equation. There is a single dependent variable y, 
which depends on the single independent variable x. 
On the other hand, in a partial differential equation the dependent variable is 
a function of more than one independent variable. Consequently, these equations 
contain partial differential operations. For example, 
Fy(x, 0) 52% t) 
Ox ot 
is a partial differential equation. It involves one dependent variable y, which depends 
on the two independent variables x and t. This dependence is emphasized by writing 
y(x, t). 


+ yx) = 0 (10.2) 


+ y(x,t) =0 (10.3) 


10.1.3 Linear versus Nonlinear Equations 


A linear differential equation contains only terms that are linear in the dependent 
variables or their derivatives. In a linear equation, if y is a dependent variable, there 
can be no y”, y(dy/dt), (dy/dt), etc. terms, nor terms where the dependent variable 
appears in the argument of a nonlinear function. There are no restrictions on the 
independent variables. Equations 10.2 and 10.3 are examples of linear differential 
equations. The equation 
BOs x*y(x) = sin(x) (10.4) 
dx 

is also a linear, even though it contains an x* factor and a sin(x) term. 

A nonlinear differential equation contains dependent variable products or nonlin- 
ear functions of the dependent variable. For example, 


d[y)]_, [ayo]. | 
ete | 7, | + ye) = 0 (10.5) 
and 
Dy) 4: x’ sin(y) = 0 (10.6) 
dx 


are nonlinear differential equations. The first is nonlinear because y(x) is squared in 
the numerator of the differential operation and because of the (dy/dx)* term. The 
second is nonlinear because the dependent variable appears inside the nonlinear sine 
operation. 

Except in rare cases, nonlinear differential equations are much more difficult 
to solve than linear ones. Many times nonlinear equations require numerical or 
approximate solutions. 
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10.1.4 Order 


The order of a differential equation, regardless of whether it involves partial or total 
differential operations, is determined by the highest derivative present in the equation. 
Equations 10.2 and 10.3 are examples of first-order differential equations, and 


d*y(x) , dy) _ 
dx3 dx 


is an example of a third-order differential equation. 


y(x) = 0 (10.7) 


10.1.5 Coupled Differential Equations 


Coupled differential equations involve more than one dependent variable. For ex- 
ample, 


d2 

20 = ni) - no (10.8) 
d2 

220) = xx) — 11 (10.9) 


form a set of coupled, second-order, linear differential equations. Both x; and x2 are 
dependent variables and are functions of the single independent variable t. 


10.1.6 Homogeneous versus Nonhomogeneous Equations 


We will restrict our comments on homogeneous and nonhomogeneous properties to 
linear equations. A linear differential equation is homogeneous if every term contains 
the dependent variable. Homogeneous equations are usually arranged so that all the 
terms are on the LHS. Equation 10.7 is one example of a homogeneous equation. In 
general, an ordinary homogeneous differential equation can always be placed in the 
form 


Lopy(x) = 9, (10.10) 


where £,, is a linear differential operator, which acts on the dependent variable y. 
The operator for Equation 10.7 would be 


=—+—-l. (10.11) 


Solutions to homogeneous equations are sometimes called “undriven” solutions, 
because they can describe the response of a system which is not under the influence 
of an external force. The solutions of homogeneous equations are not unique, but can 
always be scaled by an arbitrary constant. A homogeneous equation will always have 
a trivial solution of y(x) = 0. 

The superposition principle applies to solutions of linear, homogeneous equations. 
It states that if y,(x) and y2(x) are solutions to a linear, homogeneous differential 
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equation, then a linear sum of these solutions, y(x) = Ay yi(x) + Bo ya(x) where A, 
and B, are constants, is also a solution to the same differential equation. 

The order of a linear, homogeneous differential equation is especially important. 
It is equal to the number of nontrivial (nonzero) solutions to the equation, and 
consequently to the number of boundary conditions necessary to make the solution 
unique. 

In contrast, nonhomogeneous differential equations contain at least one term with- 
out a dependent variable. They are sometimes referred to as “driven” equations, be- 
cause they describe the response of a system to an external signal. Nonhomogeneous 
equations are usually arranged so that the terms that contain the dependent variable(s) 
are all on the left, and all other terms are on the right. Using the operator notation, an 
ordinary, linear, nonhomogeneous equation can be written 


Lopy(x) = f(x), (10.12) 
where f(x) is often called the driving function. The equation 
d2 
oS + (x) = sinx (10.13) 
dx 


is an example of a linear, nonhomogeneous equation. 
The general solution to a linear nonhomogeneous differential equation £,,y(x) = 
F(x) can always be written as the sum of two parts 


y(x) = ya(x) + yp(x). (10.14) 


The first term, called the homogeneous part, is the general solution of the corre- 
sponding homogeneous equation £,,y(x) = 0. The second part, called the particular 
solution, is any solution of the nonhomogeneous equation. 


10.2 SOLUTIONS FOR FIRST-ORDER EQUATIONS 


Many first-order equations are relatively easy to solve because the solutions can be 
obtained simply by integrating the differential equations once. However, there are 
some first-order equations that are not immediately in integrable form. In this section, 
three standard approaches for these types of problems are presented: the separation of 
variables, the formation of an exact differential, and the use of an integrating factor. 


10.2.1 Separation of Variables 
Any first-order differential equation that can be placed in the form 
dy(x)__ Pa) 
dx Qy) 


may be able to be solved using the method of separation of variables. The minus sign 
in this equation is an arbitrary convention. The differential equation may be nonlinear 
through the Q(y) term. 


(10.15) 
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Rewrite Equation 10.15 as 
Q(y) dy = —P(x) dx. (10.16) 


Now if both sides of Equation 10.16 can be integrated, we can generate a functional 
relationship between x and y, with one arbitrary constant of integration. In physi- 
cal problems, that constant is determined by one boundary condition that must be 
specified to make the solution unique. 


Example 10.1. The nonlinear differential equation 


dy = cos(x) 
dx y 


, (10.17) 


with the boundary condition of y(0) = 0, can be solved by separation of variables. 
Equation 10.17 leads to the integral equation 


[oy = fu cos(x). (10.18) 
Both sides of Equation 10.18 easily integrate to give 
y? 
a + C, = sin(x), (10.19) 


where the two arbitrary constants of integration have been combined into the single 
constant C,. Imposing the boundary condition y(0) = 0 forces C, = 0. Thus we 
have the final solution 


y = V/2sin(x). (10.20) 


10.2.2 Forming an Exact Differential 


If itis not possible to separate variables, an approach that sometimes works is to form 
an exact differential. Try to set up the equation in the form 


dy(x) _ _ Ry) 


: 10.21 
dx ~  S(y) es 
Equation 10.21 can immediately be manipulated to read 
R(x, y)dx + S(x,y)dy = 0. (10.22) 


This equation is in the form of an exact differential if there is a function @(x, y) such 
that 


IPE) = Ree) (10.23) 
Ox 
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and 


“Y = S(x,y). (10.24) 


If such a #(x, y) exists, the LHS of Equation 10.22 can be written as 


R(x, y)dx + S(x, y)dy = 200.9) cre 29) rn 


= dd, (10.25) 
and Equation 10.22 becomes 
dd = 0. (10.26) 


The solution to Equation 10.21, the original differential equation, is therefore 
(x, y) = C,, where C, is some constant. As before, this constant is determined 
by the boundary condition for the problem. 

There is a simple test to see if a pair of functions forms an exact differential. If 
R(x, y) and S(x, y) form an exact differential, Equations 10.23 and 10.24 must be 
satisfied. If x and y are independent variables and the partial derivatives of (x, y) 
exist, then 


(x,y) _ d(x, y) 


(10.27) 
Oxdy OYOX 


This means that for R(x, y) and S(x, y) to form an exact differential, they must satisfy 
the condition 


OR(x,y) _ 9S(x, y) 


5 re (10.28) 
Example 10.2 Consider the differential equation 
24. 542 

Gy 8 ee ey (10.29) 


dx 4xy 


with the boundary condition y(1) = 1. Applying the test given by Equation 10.28 to 
the functions on the RHS of Equation 10.29 shows that an exact differential may be 
found for this problem. Indeed, the function 


(x, y) = x? + 2xy? (10.30) 


generates the two functions 


= 3x? + 2y? 


R¢x, 9) = PE) 


S(x,y) = ony) = Axy. (10.31) 
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The solution to Equation 10.29 is therefore given by 
x? + Ixy? = Cp. (10.32) 
The boundary condition y(1) = 1 determines C, = 3, and our final result is 


3-x3 
2x - 


ee (10.33) 


10.2.3. Integrating Factors 


Integrating factors can sometimes be used to solve linear, nonhomogeneous, first- 
order differential equations of the form 


dy(x) 
dx 


+ P(x) y(x) = G(x). (10.34) 


In this equation, P(x) and G(x) are known functions of the independent variable x. 
An integrating factor is a function a(x), by which we multiply both sides of Equation 
10.34, so the LHS is placed in the form 


d 
le y(x)]. 


In other words, we seek an a(x) such that 


dy(x d 
a(x)| 2 + Paya] = Zlaw) vo. (10.35) 
dx dx 
Using the product rule of differentiation on the RHS of Equation 10.35 gives a simple 
first-order differential equation for a(x): 
da(x) 


a(x) P(x) = a (10.36) 


This equation can be rearranged as 
— = P(x)dx (10.37) 


and solved for a(x) as follows: 


Ina = [aspe (10.38) 


a(x) = ef PO), 
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Notice, it must be possible to calculate the integral of P(x) in order to use this method. 
It is not necessary to worry about the constants of integration associated with the two 
integrations performed in the steps above. The original differential equation we are 
trying to solve is first order, and therefore needs only one arbitrary constant. This 
constant will be introduced in the subsequent steps. 

With a(x) determined, the solution to the original differential equation can now 
be obtained. Multiplying both sides of Equation 10.34 by a(x) and using Equation 
10.35 gives 


d 
Fg le) YOO] = aa) G(X). (10.39) 
Ix 
Integrating both sides results in the expression 
a(x)y(x) = / dx a(x)G(x) + Co. (10.40) 


The integration constant C, is determined by the single boundary condition that is 
necessary to make the solution unique. The final result is 


é f dx a(x)G(x) + Cy 
a(x) / 


yx) (10.41) 


Notice that to use this process, not only must we be able to integrate the function 
P(x) to get a(x), but we must also be able to integrate the function a(x)G(zx). 


Example 10.3. As a simple example of using an integrating factor, consider the 
differential equation 


— + y= e*, (10.42) 


with the boundary condition y(0) = 1. The integration factor is simply 
a(x) = ef ® = @*, (10.43) 


Substitution into Equation 10.41 gives 


e~+ Coe”. (10.44) 


The boundary condition requires C, = 1/2, so 


y(x) = cosh(x). (10.45) 
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10.3 TECHNIQUES FOR SECOND-ORDER EQUATIONS 


Second-order differential equations pervade physics and engineering problems. One 
of the most common equation in physics is the second-order equation 


dit Seago (10.46) 


which describes the motion of a simple harmonic oscillator with a period of 27/wy. 
As mentioned earlier, the number of nontrivial, independent solutions of a linear 
differential equation is equal to the order of the equation. Thus, a linear second-order 
equation should have two different solutions. We can see that this is the case for 
Equation 10.46, since both 


y(x) = sin(w,x) (10.47) 
y(X) = COS(Wx) (10.48) 


are valid solutions. 

There is no general analytic method for finding solutions to second-order and 
higher equations. Solutions are often obtained by educated guessing, or numerical 
methods. There is a general technique, however, for a specialized class of equations 
which have constant coefficients. In this section, we describe a simple technique 
for solving ordinary, second-order, linear differential equations with constant co- 
efficients. Then we introduce an important quantity called the Wronskian, which 
provides a powerful method for generating a second solution to a homogeneous dif- 
ferential equation when one solution is known. The Wronskian can also help generate 
the particular solution to a nonhomogeneous equation, if the general homogeneous 
solution is already known. 


10.3.1 Ordinary, Homogeneous Equations with Constant Coefficients 


The most general form of a second-order, linear, ordinary, homogeneous differential 
equation can be written as 

d’y(x) dy(x) 

a + PQ.) + Q(x) y(x) = 0. (10.49) 
Equation 10.49 is said to have constant coefficients, if the functions P(x) and Q(x) 
are constants, 


d*y(x) _ , dy) 
P 
dx? dx 
where the factor of —2 was used in the second term to simplify the algebra to come. 


The manipulation of the linear equations is assisted by defining the differential 
operator of order n as Di, = d” /dx". 1n this notation, Equation 10.50 becomes 


+ qy(x) = 0, (10.50) 


(Dep — 2PDop + q)y(x) = 0. (10.51) 
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We can now perform a fancy trick, and factor this as if it were a quadratic algebraic 
expression, 


(Dop — r+) (Dop — r—)y(x) = 9, (10.52) 
) 
s(x, 


where r+ = p + /p* — q. This expression is completely equivalent to Equation 
10.51. 

This can be solved in two steps. First solve for s(x), the bracketed term in Equation 
10.52, 


(Dop — r+)s(x) = 0. (10.53) 
This is a simple first-order equation with the solution 
s(x) = Ae’, (10.54) 


where A is our first undetermined constant. The second step is to solve the remainder 
of Equation 10.52: 


(Dop — r—)y(x) = s(x) = Ae’**. (10.55) 


This is another first-order equation. Its solution can be obtained by using an integration 
factor of e~’-*, with the result 


y(x) = e”* ls +A if dx | , (10.56) 


where we have left the integral indefinite and introduced B, our second undetermined 
constant. 

At this point, there are two cases to consider. First, if Pp’ = q, the two “roots” are 
equal: r_ = ri = p. In this case, Equation 10.56 becomes 


y(x) = (Ax + B)e”* for p* = q. (10.57) 
On the other hand, if p* # q, then we get 
y(x) = Ae’** + Be™* for p # q’. (10.58) 
In both cases, there are two arbitrary constants, which are determined by the boundary 
conditions of the problem. 
“What if the roots are imaginary?” we hear you asking. In other words, what 
happens if p? < q, forcing r+ and r_ to be complex? It turns out Equation 10.58 


is still entirely valid, if you remember the compiex definition of the exponential 
function: 


e**)) = @*[cos y + isin y). (10.59) 


If p*? < q, r+ and r— will always be complex conjugates. In this case, define r. = 
r, = ir; where r, is the real part of r,, and r; is the imaginary part of r+. Using this 
definition and redefining the arbitrary constants converts Equation 10.58 into 


y(x) = e”* [A cos(r:x) + Bsin(r;x)] for p* < gq. (10.60) 
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Example 10.4 Consider the classic example of the harmonic oscillator, governed 
by the second-order equation 


Sh ? y(t). (10.61) 
In operator notation, this equation is represented by 
(Dz, + w2)y(t) = 0. (10.62) 


Referring to Equation 10.50, p = 0 and q = @?. The roots associated with this 
equation are r+ = +iw,. When substituted into Equation 10.60, the general solution 
for the simple harmonic oscillator becomes 


y(t) = Acos(w,t) + B sin(w,f). (10.63) 


10.3.2 Finding a Second Solution Using the Wronskian 


The Wronskian is a useful tool for generating the second solution of a second- 
order, linear, homogeneous equation when the first solution is already known. The 
Wronskian, itself, is a function of the independent variable and can be determined 
directly from the differential equation. The Wronskian for higher-order equations is 
discussed in a later section of this chapter 

As we pointed out earlier, the most general form of a second-order, linear, ordinary, 
homogeneous differential equation can be written as 


d? d 


Because this is a second-order, linear equation, there will be two nontrivial solutions, 
y,(x) and y2(x). Any linear combination of these solutions 


y(x) = Cryi(x) + Cyyo(x), (10.65) 


where C, and C) are arbitrary constants, is also a valid solution. 
The Wronskian W(x) is defined in terms of these two, nontrivial solutions as 


W(x) = yx) | — yo(x) bea 
a dx 
= yy (x) yo'(x) — yo(x) y1'(x), (10.66) 


where we have used the shorthand notation y'(x) = dy/dx to represent derivatives. 
The Wronskian can also be written using a determinant: 


wos) = | yi(x) — y2(x) (10.67) 


yi'(x)  y2"(x) 


det 


350 DIFFERENTIAL EQUATIONS 
The Wronskian can be used to tell whether the y,(x) and y2(x) solutions are 
independent. If two functions are linearly dependent, we can write 
yi(x) = ke ya(x) (10.68) 
for some choice of the constant k,. Taking the derivative of both sides gives 
yr) = ke y2"(x). (10.69) 


Consequently, if the two solutions are linearly dependent, 


yilx) _ yi’) 
—— = (10.70) 
ya(x) —-ya/{x) 
and W(x) = 0. 
As a simple example, return to the harmonic oscillator equation 
d7y(x 
i ) + w2y(x) = 0. (10.71) 


We already have shown that the two nontrivial solutions are 


yi(x) = sin(@,x) (10.72) 
y2(x) = cos(w,x). (10.73) 
The Wronskian in this case is 
_ sin(@,x) cos(@,x) 
WONT @p COS(WoX) = — Wy SiN( WX) a 
= ~ Wo. (10.74) 


Because the Wronskian for this differential equation is not zero, the two solutions 
are linearly independent. 

An important feature of the Wronskian is that it can be obtained directly from the 
differential equation, without knowing either of the solutions. Consider the general 
second-order differential equation given by Equation 10.64. Start with W(x), as 
defined by Equation 10.66, and take its derivative 

W'(x) = yi yo" + yi'y2' ~ yo’! — yay” 
= yi yo” — yoy”. (10.75) 


Both y,(x) and y2(x) are unknown functions, which separately satisfy the differential 
equation. The next step involves writing the pair of equations 


yi(~)Ly2” + P(x)y2' + O(~)y2] = 0 (10.76) 
and 


ya(x)In" + Px)’ + O@)yi] = 0, (10.77) 
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which are generated by plugging the two solutions into Equation 10.64. Subtracting 
these two equations gives 


yi yo" — yoy" + P(e) [yi ye’ — y2yi1'] = 0. (10.78) 


Notice the dependence on Q(x) has conveniently canceled out. Equation 10.78 can 
now be written in terms of the Wronskian and its derivative, 

dw 

aw) + P(x) W(x) = 0. (10.79) 

dx 

So, without any knowledge of the solutions to the original differential equation, a 
simple first-order equation for the Wronskian has been derived. The solution for W(x) 
can be obtained by separation of variables: 


W(x) dW ‘a 
—_— = - dx P(x) 
i, Ww Xo 


W(x) = Wy em Leo xP), (10.80) 


where W, is a constant, equal to W(x,). From this expression it is evident that if W(x) 
is zero for some value of x, it will be zero for all x. 

This ability to determine W(x) directly from the differential equation is important, 
because we can use it to generate the second solution of a differential equation if one 
solution is already known. To see this, assume that y;(x) is known and y,(x) is not. 
This situation might arise when it is possible to guess one solution. Start by forming 
the derivative of y2(x) divided by y,(x): 


d | _ Yi) y2'(x) = yo(x) yi!) (10.81) 
dx | yi(x) yr) 
Next express this derivative in terms of the Wronskian: 
d ui) W(x) 
eee = : 10.82 
dx 25 yi) ‘ } 
Integrating both sides with respect to x gives 
WO) | 
= d ; 10.83 
tx) = yx) f ax Fes (10.83) 


10.3.3 The Wronskian and Nonhomogeneous Equations 


The Wronskian can also be used to obtain the particular solution to a second-order, 
linear, nonhomogeneous differential equation. Consider a general, linear, nonhomo- 
geneous equation with the form: 


d?y(x) dy(x) 


a(x) de + d(x) + c(x) y(x) = d(x). (10.84) 
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The functions, a(x), b(x), c(x), and d(x) are known functions of the single independent 
variable. The general solution to Equation 10.84 can always be written as 


y(x) = yp(x) + Cyyi(x) + Cry2(x), (10.85) 


where y,(x) and y2(x) are the two independent solutions to the corresponding second- 
order homogeneous equation, that is Equation 10.84 with d(x) = 0. The function, 
p(x), is called the particular solution and is any one solution to the nonhomogeneous 
Equation 10.84. The constants C, and C, are determined by the boundary conditions. 
There must be two such conditions to determine these constants and uniquely specify 
the problem. 

We will now show that if a(x), b(x), c(x), d(x), and the two nontrivial, homoge- 
neous solutions y,(x) and y2(x) are known, the general solution represented by y(x) 
in Equation 10.85 can be obtained using the Wronskian. Actually, only one of the 
homogeneous solutions really needs to be given at the start, because Equation 10.83 
can be used to determine the second. The Wronskian can be obtained using either 
Equation 10.66 and the two homogeneous solutions, or from Equation 10.80 with 
P(x) = b(x)/a(x). 

It is convenient to reformat the problem. Instead of attempting to find the solution 
yp(x) and adding it to the two homogeneous solutions, we will try to find y(x), the 
function given by Equation 10.85, assuming it can be written as 


y(x) = Ci) yi (x) + Co(x) yor). (10.86) 


We will seek solutions for the two functions, C;(x) and C2(x). An important thing 
to notice right from the start is that C,(x) and C(x) are not unique. This is quickly 
evident if you assume the y,(x) of Equation 10.85 is known. On the one hand, 
C,(x) could equal the constant C), in which case C2(x) = C2 + yp(x)/y2(x). On 
the other hand, we could have just as easily made C(x) the constant C2, forcing 
C(x) = C + y,(x)/y1(x). The fact that these functions are not unique will play an 
important role in the discussion that follows. 

Insert Equation 10.86 into the Equation 10.84. To do this, two derivatives of y(x) 
must be calculated. Using Equation 10.86, the first derivative is given by 


dy(x) 
dx 


Because C(x) and C(x) are not unique, we are free to impose a relationship between 
them, to make them unique. This relationship will be chosen to simplify Equation 
10.87. In mathematics, such a relationship is often referred to as a gauge. In this 
particular case, a useful gauge is the requirement that 


= Cy(x) vie) + Cox) y2'(x) + Cre) vie) + C2"(x) y2(x). (10.87) 


Cy'(x) yi(x) + Co'(x) yo(x) = 0. (10.88) 
This simplifies Equation 10.87 to 


dy(x) 


eas C(x) yi'(x) + C2(x) y2'(x). (10.89) 
x 
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The second derivative of y(x) becomes 
dy(x) 
dx? 
Equations 10.86, 10.89 and 10.90 can now be substituted into Equation 10.84. Taking 
into account that y,(x) and y2(x) both satisfy the homogeneous equation with d(x) = 
0, a simple first-order differential equation involving C(x) and C(x) is obtained: 
a(x) [Cy'(x) yy"(x) + C2'(x) y2"(x)] = d(x). (10.91) 


This equation, plus the gauge given by Equation 10.88, results in a pair of coupled 
differential equations that determine C, (x) and C2(x) to within two arbitrary constants: 


= Ci(x) yi "(x) + Cox) y2"(x) + Cr'@)y'(a) + Cy'Cya'(x). (10.90) 


d 
Coy) + G'@yr'@ = £2 (10.92) 
a(x) 
Cy'(~)yi (x) + C2'(~)y2(x) = 0. (10.93) 


To solve these equations, they must be uncoupled. Equation 10.93 can be used to 
solve for C>/(x): 


Cr(x) = —C)'@) E a . (10.94) 
y2(x) 
Now substitute this back into Equation 10.92 to obtain 
d 
C1'(x) [yi @&) yo(x) — yr) y2'Q)] = noo. (10.95) 


The term in brackets is — W(x), the negative of the Wronskian. Thus we get the two 
relations, 


dC\(x) _ __ y2(x) d(x) 


dx W(x)a(x) ae 
dC2(x) _ 4 MG)dG) (10.97) 
dx W(x) a(x) 
Integrating these two equations gives 

= y2(x) d(x) 
C(x) = [os eva + aconstant (10.98) 

= yi(x) d(x) 
Cox) = + i dx Eero | + another constant. (10.99) 


These expressions can now be used to generate the solution to the nonhomogeneous 
equation: 
y(x) = C(x) vie) + Co(X) yr(2). (10.100) 


If the constants in Equations 10.98 and 10.99 are associated with the C, and C> of 
Equation 10.85, the particular solution can be written 


ae yo(x) d(x) yi (x) d(x) 
yp(x) = yay | de Fara + ata) fax Eera : (10.101) 
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10.3.4 Higher-Order Wronskians 


The second-order Wronskian, introduced above, can be generalized to higher-order 
differential equations by extending the determinant definition given by Equation 
10.67. Unfortunately, these higher-order Wronskian are not very useful for generating 
new solutions, because the math gets very complex. They are still of value, however, 
for determining if solutions are linearly independent. 

An n™ order Wronskian can be generated from the determinant 


yi(x) —-yolx) ws Yn) 
yilx) yo") + yn'(X) 
Wx) =| 0") ya") yn) . (10.102) 
yee) yo) + yn) | ee 


Here the y;(x) are the n solutions of a linear n™ order, homogeneous differential 
equation. As before, if W(x) # 0, the y;(x) solutions are linearly independent. 


10.4 THE METHOD OF FROBENIUS 


The method of Frobenius is a series approach to solving linear, homogeneous differ- 
ential equations of any order. It is usually a method of last resort, because it involves 
a good deal of algebra and many times leaves the solution as an infinite series that 
cannot be placed in closed form. However, the method can be used to obtain solutions 
to many equations that can be solved in no other way. 
The technique is probably best described by example. Consider the second-order 
differential equation 
2 dy(x) 
x ax + 
This is a form of Bessel’s equation, which is common in physics and engineering 


problems with cylindrical geometries. We will try to find a solution using a power 
series of the form 


2x oe + (x2 — 2)y(x) = 0. (10.103) 


y(x) = So enx"”, (10.104) 


n=0 


where the c, are unknown coefficients, and s is a currently unknown integer, either 
positive or negative, which determines the starting power of the series. 
When the RHS of Equation 10.104 is inserted into Equation 10.103, we obtain 


x? So en(n + s)\(n + — 1)x"*8-? + 2x > Ca(n + s)x"*57! 


n=0 n=o0 


+ - 2) So en x"*5=0 (10.105) 


n=0 
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where we have differentiated the series for y(x) term by term. This approach is valid 
if the resulting series converges uniformly, a condition that must be checked once s 
and the c, are determined. 

The terms can now be grouped based on the powers of x they contain: 


eat [(n + s\n + 5 — 1) + An + 8) — II x"*5 + ae =0. (10.106) 
n=0 


Equation 10.106 is of the form 


SS dmx” =0. (10.107) 


The only way the LHS of this equation can be zero for all x is to have all the a,, 
coefficients equal to zero. To isolate the various powers of x and their coefficients, 
rewrite Equation 10.106 as 


[o'e) 


Soen [(n + s\n + 5 — 1) + 2(n + 8) — 2] x" + So cg x"** = 0. (10.108) 


n=0 n=2 
The first series in Equation 10.108 has powers of x starting with x°, ie., x°, xt}, 
x5*2,__.. The second sum has powers of x starting with x°*?, ie., x°*?, x°+3,.... So, 


in order to group according to powers of x, the first two terms of the first summation 
in Equation 10.108 have to be written out explicitly. The remaining terms can be 
written as a sum from n = 2 ton = ©; 


Co (ss — 1) + 2s — 2] x* + ey [(s + 1S) + 28 + 1) — 2) 7! 


+ d {eo [(s +ny(st+n-1)+2(st+n)- 2] + alate 


n=2 


=0. (10.109) 


Each of the terms in Equation 10.109 must be equal to zero. The coefficients of 
the x° and the x*t! terms form what are called the indicial equations: 


Co [(s\(s —-1)4+2s- 2| =0 (10.110) 
ey [(s + 1)(s) + 2 + 1) — 2] = 0. (10.111) 


These equations determine s. If Equation 10.104 is to generate a nontrivial solution, 
the c,, coefficients cannot all be zero. There are several approaches that all generate 


the same solution. We will start off by assuming that c, # 0. That means in order to 
satisfy Equation 10.110, 


(sis — 1) + 2s-2=0. (10.112) 
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This leads to two solutions for s: s = 1 and s = ~—2. These two different values for s 
mean that Equation 10.104 really represents two solutions, 


yi(x) = SP ann fors = 1 (10.113) 
n=0 

y2(x) = So bn x? for s = —2, (10.114) 
n=0 


where the c,, have been replaced by a,, and b,, to distinguish between the two solutions. 
The fact that there are two nontrivial solutions is no surprise since we are dealing with 
a second-order differential equation. Notice that the order of the indicial equations 
determined the number of solutions. 

Forcing s = 1 makes Equation 10.110 true for any nonzero value of ay. In the 
Same manner, when s = —2, Equation 10.110 holds for a nonzero value of b,. With 
5 = ], the only way Equation 10.111 can be true is if a, = 0. Likewise, if s = —2, 
the only way Equation 10.111 works is if b; = 0. Thus the indicial equations show 
that there are two solutions. One starts off with dg # 0 and a, = 0. The second has 
by # 0 and b; = 0. Notice we could have satisfied Equations 10.110 and 10.111 
with co = O and c; # 0. This would have led to two different values for s, i.e., s = 0 
and s = —3. It is not hard to see that the solutions generated with this choice are 
identical to the solutions we will obtain below. 

What remains now is to determine the rest of the a, and b, coefficients. They are 
obtained from the n = 2 terms of Equation 10.109. For s = 1 and n = 2 we obtain 


a,|(n + 1)(n) + 2(n + 1) — 2] + ag-2 = 0. (10.115) 


This forms a generating equation for the a,: 


Qn-2 


=e => 
an aan n=2. (10.116) 
A similar argument for the b, with s = —2 gives 
_ b,-2 
by, = ae ae or n=2. (10.117) 


Evaluating the first five coefficients for each series gives 


a = a bo = bo 

Qi = 0 b, = 

ay = —ao/10 bz = bo/2 
a= 0 b, = 

a= ay /280 bg = —by/8. 


So the two series solutions become 


eS 
09 = ao(+— 55 +5 —" (10.118) 
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and 


1 


fehl ae ae ee 1 
ya eae a (10.119) 


Now that these series solutions have been obtained, we must check to see if they 
converge. For this example, convergence can be shown using the ratio test. For y;(x), 
convergence occurs if 


; an xntl 
lim 


n—00 


<1. (10.120) 


An—2 x" 1 
From Equation 10.116, this condition is met because 
Be: 
n+ 3n 


jim | <1 x #0, (10.121) 
for all x except x = 0. We must check the x = 0 point specifically, because at this 
point there is a division by zero in Equation 10.120. A quick look back at Equation 
10.118 with x = O shows that there are no problems here either. So y,(x) converges 
for all values of x. 

For y2(x), convergence occurs if 


; by x" > 
jim b,x <1. (10.122) 
From Equation 10.117, we have 
pe 
lim 5 <1 x #0, (10.123) 
no |n* — 3n 


for all x # 0. Again, we need to return to the original series in Equation 10.119 to 
see what happens at x = 0. Because this series contains a term with a negative power 
of x, namely bo/ x, yo(x) will not converge at x = 0. 

Plots of y; (x) and y2(x) are shown in Figure 10.1. These functions have been given 
special names. With ag = 1/3, y,;(x) becomes the spherical Bessel function of order 
one and with bp = —1, y2(x) becomes the spherical Neumann function of order one. 


y y,(x) 


Figure 10.1 Spherical Besse] and Neumann Functions 
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The example given above is one of the most straightforward applications of the 
method of Frobenius. Complications can arise if the two indicial equations have a 
common value for s. This is treated in one of the exercises at the end of this chapter. 


10.5 THE METHOD OF QUADRATURE 


The method of quadrature sometimes works to find solutions to nonlinear, second- 
order differential equations of the form 


d*(x) 
dx at el 


where f() can be a nonlinear function of the dependent variable ¢. 


= f(#), (10.124) 


10.5.1 The General Method 


The general method of quadrature is fairly simple. Starting with a differential equation 
in the form given by Equation 10.124, both sides are multiplied by d@/dx: 


do(x)| [a7 h(x)] _ [dd@) 
| dx | | dx2 rs [se ic (10.125) 
The LHS of this equation is manipulated to give 
1d {d(x) d(x) 
ral dx | - ee | 106. (10.126) 


Integrating both sides over x and additional manipulation gives 


2 
do| _ 
<¢| 7 2 | do so) (10.127) 
/ 
4 7 p fora), (10.128) 
ee... Lene 
rer: f(r? fu (10.129) 
soy % (10.130) 


where the function g(¢) has been used to represent the result of the integration on 
the LHS of Equation 10.129. The solution to Equation 10.124 becomes 


o(x) = g '(x). (10.131) 


To apply this method it must be possible to perform both the integrals on the RHS 
of Equation 10.127 and the LHS of Equation 10.129, and also to invert Equation 
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10.130. In this derivation, the integration constants have been ignored. As will be 
shown in the following example, these constants must be accounted for and make the 
application of this method more difficult than the steps above imply. 


10.5.2 A Detailed Example: The Child—Langmuir Problem 


The solution of the Child—Langmuir problem is a classic example of the use of the 
method of quadrature. It also demonstrates the importance of the boundary conditions 
for these types of solutions. 

The Child—-Langmuir problem describes electron flow in the anode-cathode gap 
of an electron gun. The geometry is shown in Figure 10.2. A heater boils electrons 
off the cathode, into the anode-cathode gap. In the gap, the electrons are accelerated 
by an electric field imposed by the applied voltage V,. In an actual electron gun the 
anode is a mesh or has a hole to allow most of the accelerated electrons to continue 
past the anode and, in most cases, to the screen of a cathode ray tube. In this problem, 
however, we will concentrate only on the dynamics inside the anode-cathode gap. 
The cathode is located at x = 0, the anode at x = d, and the problem will be treated 
one dimensionally. This is a valid assumption as long as the dimensions of apparatus, 
perpendicular to x, are large compared to the gap distance d. 

If very few electrons are in the gap at any time, their interactions with each other 
will be negligible. In this case, the motion of any one electron is easy to describe. It 
sees a uniform electric field E in the gap that arises from the linear potential (x), 
shown in Figure 10.3: 


E=-Vo 
a _@o Vix é 
Ox d ” 
V, 
= (=) é,. (10.132) 
Cathode Anode 
Heater E Or 


Jif 
I af Nt 
Figure 10.2 The Anode Cathode Gap of the Child—~Langmuir Problem 
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d 
Figure 10.3 Potential Seen by an Isolated Electron 


In the gap, the electron feels a constant force F given by 


ex, (10.133) 


where —gq, is the electron charge. The equation of motion for the electron can be 
written as 
d’x(t) — qo Vo 
= £000 10.134 
Mo dre d ( ) 
where m, is the electron mass. The solution to this differential equation is easily 
obtained by integrating twice. If it is assumed that the electron leaves x = 0 with 
zero velocity, that is v,(0) = 0, its position as a function of time is given by 


_ { GVo 2 
x(t) = (2%) t’. (10.135) 


Using conservation of energy, we can write an equation that relates the velocity of 
the electron, as a function of position, to the potential: 


mate? = gee 
2 0 


= qo (x). (10.136) 


To arrive at Equation 10.136 we assumed that @(0) = 0. This equation can now be 


solved for the velocity: 
2 
v(x) = {22 (10.137) 


In the Child—Langmuir problem, the situation is more complicated because there 
is not just a single electron in the anode-cathode gap. Instead, many electrons are 
emitted at a high, continuous rate, so the gap can be filled with billions and billions 
of them. The presence of the electron charges modifies the electric potential in the 
gap so it no longer looks as shown in Figure 10.3. The field must now be treated 
self-consistently using Maxwell’s equations coupled with the equation of motion for 
the electrons. 
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We will look for a steady-state solution, with no time dependence. In that case, 
Maxwell’s equations simplify to 


VxE=0 (10.138) 
VxH=J (10.139) 
V -«E =p. (10.140) 
V - oH = 0 (10.141) 


where E and H are the electric and magnetic field vectors, J is the current density, p, is 
the charge density, and €, and x, are the dielectric constant and magnetic permiability 
of free space. Because there is no time dependence, the electric field has no curl and 
can still be expressed as the gradient of the scalar potential: 


E=-V¢. (10.142) 


Also, the equations involving the electric and magnetic fields uncouple, and Equations 
10.140 and 10.142 can be combined, with the result 


d>p(x) _ pe(x) 
dx? e.* 


(10.143) 


Given the boundary conditions of the problem, we will assume that ¢(0) = O and 
o(d) = Vo. 

There are two unknowns in Equation 10.143: the potential @(x) and the charge 
density p,(x), both functions of x. Consequently, to solve the problem another in- 
dependent equation relating p, and ¢ is necessary. This equation comes from the 
mechanics of the electron motion. To obtain this equation, we must introduce the 
current density J = J,(x) é,, which is just the charge density times the local velocity: 


F(X) = pe(x) v(x). (10.144) 


Using the expression derived in Equation 10.137, the velocity can be removed 
from Equation 10.144, and the result can be solved for p,(x): 


pelx) = Ie(x), aces (10.145) 


Although we now have an equation relating the potential and the charge density, it 
involves another unknown J,(x), so some other information is still necessary. For 
that, look at the continuity equation, which we derived in Chapter 2: 


V-J+ oe =0. (10.146) 


Since we have assumed. a steady-state solution, dp,/at = 0, and the current density 
must be a constant J = —J, @,. We have added a minus sign to make the constant J, 
positive. The actual value of J, will be determined later, after we apply the boundary 
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conditions. Equation 10.145 becomes 


mM, 
Rr ee Mi ON 10.147 
pc(x) 2aob(x) ( ) 


Inserting Equation 10.147 into 10.143 gives: 
d>h(x) _ Jo [Te 
dx? € V 2Goh(x) 


This is a nonlinear differential equation in the form of Equation 10.124, which can 
now be solved using the method of quadrature. First, equation 10.148 is multiplied 


(10.148) 


by d/dx to obtain 
1d [db@]? _ Jo [me ,-1n4¢ 
2dx | dx eV 20, ide (10.149) 
Integrating both sides from the cathode to some arbitrary point x gives 
db]> [db]? _ 4J, [mo 1/2 6 
E i E I. 7 Ey 2Go [oe $(0) | : (10.150) 


Because (0) is zero, we can simplify this to 


aa Fal BI Gig fi a 
abi eam) (ae = 2, /—* bx)”. (10.151) 
| dx x dx x=0 £0 24o : 

The other constant of integration, (dp/dx)"| 5» still needs to be evaluated. In 
order to do this, we must leave mathematics and return to the physics of the problem. 
According to Equation 10.142, this constant is just the square of the electric field at 
the cathode 


(db/dx)| 


If there were no electrons in the gap, the potential would behave as shown in Figure 
10.3, and (d/dx)*|__, = V2/d?. With electrons in the gap, the electric field, and 
consequently the potential, are modified. With a single electron in the gap, the electric 
field of the electron adds to the vacuum electric field as shown in Figure 10.4. To 
the left of the electron, the total electric field is decreased slightly and to the right 
the field is slightly increased. As we add more electrons to the gap, regardless of 
how they are distributed, the electric field is always decreased at the cathode and 
increased at the anode, compared to the vacuum field. The modified potential has the 
general form shown in Figure 10.5. Of course, because electrons are point charges, 
the electric field changes discontinuously from one side of an electron to the other. 
With many electrons in the gap, however, the charge can be modeled by a continuous 
charge density, and the electric field and potential vary continuously. 

As electrons are thermally emitted from the cathode, they are accelerated into the 
gap. If the electrons are emitted at a slow rate, there will be few electrons in the gap 
at any one time, and the electric field at the cathode will vary only slightly from the 


o = E;(0). (10.152) 
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vacuum 


IE| decreased IE| increased 


CEN 
a a 


E 


electron 


Cathode Anode 
Figure 10.4 Modification of the Vacuum Electric Field by a Single Electron 


vacuum case. As the temperature of the cathode is increased, electrons are emitted 
at a higher rate, and there will be more electrons in the gap. Now the electric field 
and the slope of the potential at the cathode become smaller. Therefore, there are 
many solutions for this problem, depending on the temperature of the cathode. The 
solution we will obtain is for the highest rate of electron injection possible. This will 
occur just as the electric field at the cathode goes to zero, because if the field were to 
reverse sign, electrons would be forced back into the cathode. For this solution, the 
second boundary condition is (d@/dx)|,=9 = 0. 
We can now proceed with the solution. Equation 10.151 becomes 


dbx) _ (4Jo [itg\'? 174 
eC - (2) $'/4(x) (10.153) 


This equation can be set up as an integral, again from the cathode where @ = 0, to 


the point x in the gap: 
(x) ! x 4 1/2 
soa = | Png eae aa le (10.154) 
op pA dy € V 240 


Figure 10.5 Potential Depressed by Electrons in the Gap 
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Performing the integration in Equation 10.154 gives the potential as a function of x: 


oJ, [m Me 
3/4 ry (ee eae 
p(x) (= ‘ | me) x. (10.155) 


The RHS of this equation still has the unknown quantity J,, but we have yet to impose 
the boundary condition @(d) = V,. This condition gives 


1/2 
vit = (32 [) d. (10.156) 
4e, V 24, 


4 0 
Jy = ay V3/2 we Joy. (10.157) 


The quantity J,; is referred to as the Child—Langmuir current density. It depends both 
upon the gap distance and the applied voltage. 

In this derivation, we assumed a particular cathode temperature, so that the electric 
field at the cathode was zero. If the cathode temperature is below this value, fewer 
electrons are injected into the gap, and the current density will be less than the Child- 
Langmuir amount given by Equation 10.157. The current density is then limited by 
the cathode temperature, and the gap is in what is called temperature limited flow. 
In this case, the current density increases exponentially with temperature until the 
Child-Langmuir limit is reached, as shown in Figure 10.6. This critical temperature 
is called the Child-Langmuir temperature T,;. Above T,;, electrons are emitted from 
the cathode at a higher rate than can be supported by the Child-Langmuir solution. If 
there are too many electrons in the gap, the electric field near the cathode is reversed, 
as shown in Figure 10.7. In this case, the field tends to decelerate the electrons, which 
are emitted from the cathode with a thermal distribution of velocities. The slow ones 
are returned to the cathode. Those that have enough velocity to make it to the point 
where the slope of the potential goes to zero, and the electric field reverses sign, 


Solving this for J, gives 


Current density 


Cathode temperature 


cl 


Figure 10.6 Temperature Limited Flow 
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Figure 10.7 Potential Distribution With the Cathode Temperature Above T,; 


are accelerated to the anode. The point where the slope of the potential goes to zero 
.s called a virtual cathode. To the right of the virtual cathode the Child-Langmuir 
solution discussed above holds. In this case, however, the voltage used to determine 
J. from Equation 10.157 will be slightly more than the applied voltage V,. It is 
increased by the magnitude of (x) at the virtual cathode. Also, the anode-cathode 
spacing used in Equation 10.157 will be less than the actual distance between the 
seal cathode and anode. It is reduced by an amount equal to the distance from the 
-eal cathode to the virtual cathode. Consequently as the cathode temperature is raised 
above T,,, the current density rises slightly above the J,; calculated using the values 
V, and d. This is a small effect, however, and the graph of the current density as 
a function of the cathode temperature becomes, qualitatively, as shown in Figure 
10.8. Assuming the voltage and gap spacing remain constant as the temperature of 
the cathode is raised from zero, the current density initially increases exponentially. 
This is the temperature limited flow region. When the temperature reaches T,), the 


Current density 


Cathode Temperature 


-____Temperature limited _____.. Space charge limited—___. 


Figure 10.8 Temperature and Space Charge Limited Flow 
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solution of Equation 10.157 is valid. As the temperature is raised above 7, there is 
only a slight increase of the current density with temperature. This is called the space 
charge limited flow region. 


10.6 FOURIER AND LAPLACE TRANSFORM SOLUTIONS 


Linear differential equations with constant coefficients can many times be solved 
using Fourier or Laplace transform techniques. If the solutions are finite and asymp- 
totically go to zero, they can be obtained using Fourier transforms. If they grow 
asymptotically, but the growth is exponentially limited, they can be handled with 
Laplace transforms. Solutions that grow faster than any exponential cannot be solved 
with these techniques. 


10.6.1 Fourier Transform Solutions 


We will consider the solutions to differential equations with space and time as the 
independent variables. If the solutions are finite and integrable, Fourier transforms 
with respect to the independent variables can be taken. Consider a function f(x, t) 
that depends on both a spatial dimension x, and the time t. The spatial-time transform 
of this function F(k, w) can be obtained by a two-step process. First consider the time 
transform pair: 


F(x, @) = ro [ : dt f(x, te (10.158) 
f(x,t) = oe dw F(x, we. (10.159) 


V 2a J—x 


Next set up the spatial transform as 


1 
Fk.) = —= if dx f(x, He (10.160) 
J24 J-« 
f (x,t) = i | dk F(k, tHe, (10.161) 
T =o 


where we have purposely swapped the signs of the exponent from the standard Fourier 
transform for the reason described below. These transformations can be combined to 
form a pair of equations relating f(x, t) and F(k, w): 


ane os - 
F(k, @) = 5— / dx / dt f(x, te or) (10.162) 


f(x,t) = ~ / dk i dw F(k, we"), (10.163) 
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Yow you can see the purpose of the sign switch for the spatial transform. The various 
omponents which make up f(x,1) with positive w and positive k can be viewed 
S$ plane waves traveling in the positive x direction, with a positive phase velocity 
if w/k. 

Many times, in physics and engineering problems, the dependent variable is a 
‘ector quantity and there are three spatial dimensions. This complicates Equations 
0.162 and 10.163 and motivates a shorthand notation for the transform pair. Consider 
he vector electric field E(x, y,z,f) = E(F,1), which is a function of three spatial 
limensions and time. The transform pair associated with this vector field quantity 
‘an be written as 


E(k, w) = a | aF i: dt E(, the or) (10.164) 
r —o 

EF, 1) = aay i d*k / dw E(k, w)e—*?, (10.165) 
k —@ 


n these expressions a propagation vector k has been introduced. In Cartesian geom- 
+try, it would be written as 


Kk = kb, + kyby + kb. (10.166) 


Che quantities d3¥ and d3k represent differential volume elements in real and k-space. 
Again, in Cartesian geometry, 


°F = dx dydz (10.167) 
d°k = dk, dky dk,. (10.168) 


Notice, the integrals involving these differentials are actually three integrals spanning 
he entire three dimensional range of F or k. 


Example 10.5 Maxwell’s equations are classic examples of differential equations 
nvolving space and time as the independent variables. They are used to describe the 
propagation of electromagnetic waves. The four equations can be written as 


v x HG) = < Bi, t) + Jr, 0) 
V x Ed) = -< MoH(F, t) (10.169) 
V - pH, 1) = 0 
V - «E(t, 1) = p-(F, 2). 
These equations form a set of coupled, partial differential equations with constant 


coefficients. If the current density J and charge density p, are specified, they can be 
looked at as driving terms, which make the equations nonhomogeneous. Alternatively, 
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if the current density and charge density are proportional to the fields, as is the case 
in linear dielectric media, the equations can be put into homogeneous form. 

These equations can be transformed using the Fourier transform pair given in 
Equations 10.164 and 10.165. Using the fact that on transformation, d/dt — iw and 
d/dx —+ —ik,, Equations 10.169 become a set of coupled algebraic equations: 


— ik X H(k, w) = iwe, E(k, w) + J(k, «) 
—ik X E(k, w) = —iwp,H(k, w) (10.170) 
—ik - poH(k, w) = 0 
—ik - «E(k, w) = R.(k, ). 
If the transforms J and R., are known, these equations can be solved algebraically for 
the transforms of the electric and magnetic fields. The real field quantities are then 
obtained by inverting these quantities. So you see, the task of solving the coupled 


partial differential equations has been simplified to simple algebra, as long and the 
Fourier transforms and inversions can be taken. 


Example 10.6 Another problem whose solution is facilitated by the Fourier trans- 
form approach is the damped, driven, harmonic oscillator. The geometry of the prob- 
lem is shown in Figure 10.9. The mass M is driven by an applied force d(t). It also 
experiences the spring force —K x(t) and a frictional force — a (dx/dt), proportional 
to its velocity. The differential equation describing the motion is 


2 
d*x(t) + Pink) + Kx(t) = d(t). (10.171) 


M dt? dt 


If d(¢) possesses a Fourier transform D(), and the solution x(t) transforms into X(@), 
Equation 10.171 can be transformed to 


—D(w)/M 


X(w) = (10.172) 


w? — iwa/M — K/M’ 


Figure 10.9 The Damped, Driven, Mass-Spring Oscillator 
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The denominator of the RHS of Equation 10.172 can be factored, and the solution is 
given by the Fourier inversion 


a! ost : —D(w)/M five 
x(t) = - [2 Can rm 7d 5 (10.173) 
where 
ia | K a2 


+ = oat -—. 10.17 
2- "IM VM 4p Conn) 

The response x(t) can be obtained once D(w) is determined. Let’s consider the 
particular case where d(t) is an ideal impulse, d(t) = [,6(t). Its transform is simply 


D(w) = ae (10.175) 


Jan’ 


and the response becomes 


me o/M iwt 
x(t) = on |, eno tere DG oe. i ; (10.176) 


where the integral has been converted to one in the complex w-plane on the Fourier 
contour F, shown in Figure 10.10. Now we can evaluate the integral using closure 
and contour integration. For ¢ > 0, the contour must be closed in the upper half plane, 
where it encloses the two poles of the integrand. For t < 0, it must be closed in the 
lower half plane and encloses no poles. A little algebra shows x(t) is given by 


0 t<0O 
x(t) = Io -at/QM) «: K_ @& . (10.177) 
——————— € sin ——-——t t>0 
VKM — a?/4 M 4M? 
_ ima 
| . @-plane 
' 
e iF 2M @ 
F real 
_ |K- o@ K_ a 
YM 4M ™M 4M 


Figure 10.10 Fourier Inversion for Damped Harmonic Oscillator 
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Figure 10.11 Damped Harmonic Oscillator Impuise Response 


This response of Equation 10.177 is plotted in Figure 10.11. The instant after the 
application of the impulse force at ¢ = 0, the mass has acquired a finite velocity, 
but has not changed its position. This velocity initiates the oscillation that is damped 
to zero by the frictional force. Notice there is no response before t = 0, as you 
would expect, because obviously the system cannot respond until after the impulse 
has occurred. This kind of behavior is often referred to as being causal. Causality, 
a characteristic of solutions involving time, requires that there can be no response 
before the application of a drive. 

Equation 10.171 is a second-order differential equation and needs two boundary 
conditions to make its solution unique. Notice that Equation 10.177 is a unique 
solution with no arbitrary constants. The boundary conditions must have somehow 
been included in the Fourier transformation and inversion process that arrived at this 
solution. 

What are the appropriate boundary conditions for this example? For a problem of 
this type, the boundary conditions are typically the initial values of x and dx/dt at 
t = 0. The initial conditions in this case, however, are complicated by the 6-function 
located exactly at t = 0. There are two ways we can look at these initial conditions. 
First, because causality implies that x = 0 for < 0, we can say just before the 
application of the impulse, at t = O_, these values are x = 0 and dx/dt = 0. Then 
the impulse occurs at t = 0 and instantaneously changes the velocity. The other view 
is to enforce the initial conditions just after the impulse is applied, at = 04, and 
remove the impulse drive from the differential equation. In this case, these t = 04 
“effective” initial conditions are x = 0 and dx/dt = I,/M. 

The solution in Equation 10.177, obtained from the Fourier analysis, satisfies both 
these conditions at f = O_ and t = 0. Because there are no poles of X(w) in the 
lower half w-plane, the Fourier inversion in Equation 10.176 will always be zero for 
t < 0, and thus satisfies the t = O_ initial conditions. Looking at Equation 10.177 
and its derivative for t > 0 shows that the t = 0, conditions are also satisfied. The 
instantaneous change in velocity, built into these “effective” boundary conditions, is 
a result of the D(w) term in Equation 10.175. 

The Fourier approach worked in this example, because the driving force could 
be Fourier transformed, and because the damped response could also be Fourier 
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ransformed. In many problems, it is not as obvious that the Fourier approach will 
work. For an unstable system, even if the driving term can be Fourier transformed, the 
‘esponse can diverge in time and, therefore, will not have a valid Fourier transform. 
Consequently, it is always safer to use the Laplace transform approach described 
delow, if the possibility of instability is present. 


10.6.2 Laplace Transform Solutions 


The Laplace transform can also be used for solving differential equations, and is 
2ssentially the same as the Fourier approach described above. It has the advantage 
of being able to treat unstable systems which grow exponentially. Unfortunately, it 
has an added complication because the initial conditions of the problem are present 
in the transforms themselves and need to be treated carefully. 

The standard Laplace transform pair is 


F(s) = i dt fie (10.178) 
0 


f(t) = = | ds F(sye". (10.179) 
L£ 


For the standard Laplace transform, it is assumed that f(t) is zero for t < 0. The 
inversion in Equation 10.179 is performed along a Laplace contour, to the right of all 
the poles of F(s). This position of the Laplace contour assures that the f(t) generated 
by the inversion is zero for t < 0. 

As with the Fourier approach, the first step is to Laplace transform the differential 
equation. Let the Laplace transform operation be represented by the L£ operator: 


L{x(t)} = X(s). (10.180) 


Then from the properties of the Laplace transform, 


. { oe} = sX(s) — x(0) (10.181) 
d*x(t) dx(t) 
c{ i \ = #° X(s) — sx(0) — = - (10.182) 


The initial value terms on the RHS’s of these equations complicate things. If, as is 
often the case, we know the solution and all its derivatives are zero at t = 0, we can 
drop these terms. Otherwise, they have to be carefully carried through the analysis, 
as discussed in the example that follows. 
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Example 10.7. As an example of the Laplace transform approach, consider an 
undamped harmonic oscillator, a simplified version of the example discussed in the 
previous section. The system can be described by the configuration shown in Figure 
10.9, with no frictional force, i.e., a = 0. The differential equation for this system 
becomes 


d*x(t) 


M 
dt? 


+ K x(t) = d(t). (10.183) 
Applying the Laplace transformation to both sides gives 
a = 
s’X(s) — sx(0) — ; + KX(s) = D(s). (10.184) 
t= 
As before, let the driving force be equal to an impulse of magnitude /, applied at 
t= 0: 
d(t) = 1,8(t). (10.185) 


The solution should be causal, so x(¢) = 0 for all t < 0. What values should be used 
for x and dx/dt at t = 0 in Equation 10.184? As with the Fourier transform example 
of the previous section, because of the 5-function drive at t = 0, there are two ways 
to think about these initial conditions. 

The first approach defines the initial conditions at t = O_ and includes the 
Laplace transform of the 6-function drive in Equation 10.184. Att = O_, x = Oand 
dx/dt = 0 so that Equation 10.184 becomes 


Ms’X(s) + KX(s) = I. (10.186) 


Now we can solve for the transformed solution: 


I,/M 
X(s) = . 
xo s+ K/M (0187) 
The final step is to invert this expression, 
I,/M : 
N= e*, 10.188 
i sai |48 < 24+ K/M- occa 


which can be evaluated using closure. The Laplace contour £ is placed to the right 
of the two poles located at s = +i,/K/M, as shown in Figure 10.12. For t < 0, 
the contour must be closed to the right, and no poles are enclosed. When t > 0, 
however, both poles are enclosed. The residue theorem and a little algebra give the 
final solution: 


t<0 


0 
wt | a JM) 130° (10.189) 


FOURIER AND LAPLACE SOLUTIONS 373 


imag 
s-plane 
JKTM 
“L 
aa real 
- /KTM e 


Figure 10.12 Inversion Contour for the Harmonic Oscillator 


The second approach defines the initial conditions at + = 0+. In this case, the 
Laplace transform of the 5-function drive is not included in Equation 10.184. The 
initial conditions must now be evaluated just after the impulse, so that x = O and 
dx/dt = 1,/M att = 0+. In this case, Equation 10.184 becomes 


Ms°X(s) ~ Ip + KX(s) = 0. (10.190) 


This leads to an equation for X(s) that is identical to Equation 10.187, and conse- 
quently the same solution for x(t) given by Equation 10.189. Notice that if thet = 04 
initial conditions as well as the Laplace transform of the 5-function drive were in- 
cluded in Equation 10.184, twice the correct answer for x(t) would be generated. 

In this example, the drive was a 6-function. Of course, this same approach works 
for other driving functions as well, as long as it is possible to obtain their Laplace 
transforms. Notice that this particular solution requires the use of the Laplace trans- 
form, as opposed to a Fourier transform. Even though the driving term can be Fourier 
transformed, the solution cannot. If a Fourier transform of this solution were at- 
tempted, it would have poles lying on the real w-axis, and we could not invert it. 


Example 10.8 This example explores the behavior of the unstable electrical circuit 
shown in Figure 10.13. This is a simple RLC-circuit except for the negative resistance, 
—R,. This oddity is a simplified model for an active element that can actually add 
energy to the circuit. The circuit is driven by a sinusoidal voltage source 


_ f0 t<0O 
UAE) ree 7° (10021) 


For t < 0 the current and all voltages in the circuit are zero. 
To obtain the differential equation that describes this circuit, start by summing the 
voltage drops around the loop, 
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Figure 10.13 Unstable Circuit 


v;(t) = iy + v,(t) + o,(t) 
= = | dti(t) — Rpi(t) + Le? 


192 
a’ (10.192) 


s) 


where i(f) is the current in the loop, as shown in Figure 10.13. Equation 10.192 is a 
mixed integral-differential equation. We can convert it to a pure differential equation 
by taking the derivative of both sides: 

d*i(t) di(t) _ du,(t) 


—R,—— + 
° dt? Ko dt aie dt 


(10.193) 


This is a second-order, nonhomogeneous differential equation with i(t) as the depen- 
dent variable. The driving term of this equation is dv,(t)/dt. 

Neither v,(t) nor its derivative have a valid Fourier transform, so we cannot Fourier 
transform Equation 10.193. We can, however, still take the Laplace transform of both 
sides of Equation 10.193 to get 


s°Lol(s) — sRol(s) + = = sV,(s), (10.194) 


where I(s) and V,(s) are the Laplace transforms of i(t) and v,(t), respectively. The 
initial conditions for i and di/dt at t = 0 are zero, because causality requires the 
response does not occur before the application of the drive. 

The Laplace transform of v;(t) is 


Vo Wo 


© + ia) — fas)’ ote) 


V,(s) = 
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so the expression for /(s) becomes 


Vo Wo S 
I(s) = sieae : 0.1 
1) = GF aye — ta, )\Los? — Ros + 17C,) ee 
The quadratic term in the denominator can be factored as 
Los? = Ros + 1/C, = Loe = 34 Ma 8), (10.197) 


where 


R [4 R 
ences ay es 10.198 
sr, VLC, 422 ( ) 


Notice that with a negative resistance, s, and s_ both have a positive real part. The 
Laplace inversion for i(t) is 


V, @ se" 


Jalna Full 
2m [es (s + iw,)(s — iw)Lo(s — 54)(s — s_) 


(10.199) 


The Laplace contour and the poles of the integrand are shown in Figure 10.14. 

To solve Equation 10.199 for arbitrary w,, R,, Lo, and C, is a messy task, which 
is simplified by choosing specific values for these quantities. We will let w, = 1, 
L, = 1, and choose the other values so that s,;, = 1 + 2i. Equation 10.199 then 
becomes 


| V, see 
Se [as (s+ is —ie— 1+ ie 1 — 21) eee) 
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Figure 10.14 Laplace Inversion for the Unstable Circuit 
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Figure 10.15 Response of the Unstable Circuit 


Using closure and residue techniques, the integration of Equation 10.200 can be 
performed to give 


{' t<0O 

ith=<¢ V, vo. Vo 4 3Vo 1. . (10.201) 
eo oe -— + > 

5 cost 10 sin t 5 e’ cos(2r) 0 e’ sin(2r) t>0 


For ¢ > 0 the current response can be written as 


i(t) = Vo cos(t + 0.157) — vee! cos(2t + 0.277). (10.202) 


2/5 


The part of this response with a time dependence of cos(f + 0.1577) is a result of 
the driving voltage at w, = 1. It persists at a constant amplitude for all ¢ > 0. The 
part of the response with a time dependence of e’ cos(2t + 0.277) is referred to as the 
characteristic response of the circuit, because it is determined entirely by R,, C,, and 
L,. It grows exponentially in time because the negative resistance causes the circuit 
to be unstable. Because of this instability, the long-term behavior of the circuit is 
dominated by this characteristic exponential response, while the part oscillating at 
the drive frequency w, becomes negligible. 

A sketch of Equation 10.202 is shown in Figure 10.15. In this figure the exponential 
growth rate has been decreased by an order of magnitude, i.e., we have used e°!! 
instead of e', so that the effects of the two parts of the response can be seen more 
clearly. For short times, both components are clearly visible, but for long times the 
unstable part dominates. 


10.7 GREEN’S FUNCTION SOLUTIONS 


The Green’s function approach is a powerful technique for finding solutions to lin- 
ear, homogeneous, and nonhomogeneous differential equations. Two methods for 
obtaining Green’s function solutions are discussed below. The first uses the Laplace 
transform and is limited to linear differential equations with constant coefficients. 
The second method is more general and can be applied to any linear differential 
equation. 
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10.7.1 Derivation Using the Laplace Transform 


Consider an ordinary, second-order, nonhomogeneous, linear differential equation 
with constant coefficients 


d*r(t) dr(t) 


Ao +B 
dt? ° dt 


+ Cor(t) = d(a), (10.203) 


where r(t) is the response to a known driving function d(t). The independent variable 
t is assumed to be time, so the concept of causality applies to the response. In other 
words, the response cannot occur before the application of the driving term. Assume 
the driving function is zero for tf < 0, and that it has a standard Laplace transform 
D(s). The causal boundary conditions are r(0) = 0 and dr/ dt\ _o = 0. The Laplace 
transform of the differential equation becomes 


(Aos” + Bos + Co)R(S) = D(s), (10.204) 


where R(s) is the Laplace transform of the response. Equation 10.204 can be solved 
for R(s): 


D(s) 
R(s) = = P 10.205 
RO Aas + B+ C money 
which, in turn, can be inverted to give the response 
1 Doje® 
t= d — : 10.206 
oe za f a oer eer a ( ) 


So far, this is nothing more than a recap of the Laplace transform approach to solving 
differential equations, which we presented in the previous section. At this point, 
however, we take a completely different tack. 

Returning to the Laplace transform of the response, it can be looked at as the 
product of two functions 


R(s) = D(s)G(s), (10.207) 


where G(s) = 1/(A,s? + Bys + C,). Consequently, from the product property of 
Laplace transforms, the time domain response is the convolution of two functions, 


r(t) = d(t) @ ge) 


=| dtd(t)g(t — 7), (10.208) 


where g(t) is the inverse Laplace transform of G(s) 


1 e# 
= : 10.209 
80) Qa [as Aos? + Bs + Cg ( ) 
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The function g(t) needs some interpretation before jumping into the convolution 
integral of Equation 10.208. The function G(s) satisfies the algebraic equation 


(Aos* + Bys + C) G(s) = 1. (10.210) 


Therefore, the same reasoning that led to Equation 10.204 implies g(t) is a solution 
to the differential equation 


2 
a8) , p WO) © arty = 5(0), (10.211) 


Ao dt2 dt 


with the same boundary conditions for g(t) as for r(t), namely g = 0 and dg/dt = 0 
at z = 0. The Laplace contour in Equation 10.209 is to the right of all of the poles 
of the integrand, so that g(#) is zero for all t < 0. Thus, the function g(t) is the 
response to a drive d(t) = 8(t), a unit impulse applied at t = 0. For this reason, g(t) 
is sometimes called the impulse response function. 

The function used in the convolution integral, Equation 10.208, is not g(t) but 
rather g(t — 7). This is simply g(t) delayed by an amount 7. Because the A,, B,, C, 
coefficients of Equation 10.211 are not functions of time, g(t — 7) is the solution 
to this equation with 5(t) replaced by 5¢t — 7), and with the causal condition that 
g(t — T) is zero fort < 7. In other words, delaying the 5-function drive just delays the 
impulse response by the same amount, without changing the functional shape. This 
behavior is called translational invariance. The function g(t — T) is called the Green’s 
function. The translational invariance of this Green’s function is a consequence of 
the constant coefficients in the differential equation and the nature of the boundary 
conditions. As we will show, not all Green’s functions have this behavior. 

The convolution in Equation 10.208 is an operation involving the driving function 
and the Green’s function. It, in effect, sums up the contributions from all the parts of 
d(t) to give the total response r(t). Notice, once the Green’s function for a specific 
differential equation is known, the response to any drive function can be determined 
by evaluating the integral in Equation 10.208. This is powerful stuff! 

Notice this derivation required several things. First, the differential equation had to 
be linear, with constant coefficients. Second, the problem had to have causal boundary 
conditions. In the next section, we will explore a derivation which requires only that 
we have a linear differential equation and a particular set of homogeneous boundary 
conditions. Eventually, even this last condition will be relaxed when we will discuss 
Green’s function solutions for problems with nonhomogeneous boundary conditions. 


Example 10.9 As an example, again consider the undamped harmonic oscillator, 
shown in Figure 10.16, with a square pulse driving force: 


0 t<0O 
dit)=< F, O<rt<T,. (10.212) 
0 Ty <t. 


GREEN’S FUNCTION SOLUTIONS 379 


d(t) , X(t) 
F os -K x(t) 


t ay) Arn t 
T 
Li 


Figure 10.16 Undamped Harmonic Oscillator with Square Pulse Drive 


The nonhomogeneous differential equation describing the motion is 


d*x(t) 


M—an 


+ Ky x(t) = d(t). (10.213) 


As usual, we will assume that there is no response before the drive, and impose the 
boundary conditions x = 0 and dx/dt = Oat t = 0. 

The first step in solving a problem of this type is to determine the Green’s function. 
This is done by solving the equation 


Me 8b = 7) 


i + K, g(t — tT) = &(t — 7), (10.214) 


with the same type of boundary conditions: 


g(t —7T)=0 t<T 
dg(t — T) 


= <T. 10.215 
a 0 t<T (10 ) 


Applying a Laplace transform to both sides of Equation 10.214 gives 
(Ms’ + K,)G(s, 7) =e", (10.216) 


where G(s, 7) is the Laplace transform of g(t — 7). This equation can be solved for 
G(s, T) and then Laplace inverted to give the Green’s function 


(je 


ae) = mi fs ‘(+ i,/K,/M My(s — i\/K,/M) 


Evaluating the integral using a Laplace contour on the right of both poles in the 
denominator gives 


(10.217) 


t<T 


0 
se (1/./KoM) sin | VK./M ss | pone 0218) 


380 DIFFERENTIAL EQUATIONS 


d(t) 


Figure 10.17 The Driving Force and Green’s Function vs. t 


The next step is to use this Green’s function and the driving function in Equation 
10.208 to determine x(t): 


x(t) = a dt d(r)g(t — 7). (10.219) 


To evaluate this convolution integral, it is instructive to plot d(t) and g(t — 7) as 
functions of t, as shown in Figure 10.17. The integral of Equation 10.219 must be 
evaluated for three separate ranges of t: < 0,0 <¢ < T,, andt > T,. The first 
range is easy because, from Figure 10.17, you can see there is no overlap of d(7) and 
&(t — 7), so x(t) = O for t < O. In the next range, when 0 < t < T,, the functions 
overlap in the range 0 < 7 < ¢, as shown in Figure 10.18. The expression for x(t) 
becomes 


t 
x(t) = at = sin [VK./M == 2) O<t<,. (10.220) 
0 o 


KM 


This integral is straightforward, and gives the result 


x(t) = z [1 — cos (VK./M1) | 0<1<T,. (10.221) 


Figure 10.18 Overlap for0 <t< T, 
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g(t-t) 


Figure 10.19 Overlap for t > 7, 


The overlap in the final range, when t > T,, is shown in Figure 10.19. In this case, 
the integral for x(t) becomes 


T, 
x(t) = if dt Tei sin [VKo/M = 7| T, <t< ©, (10.222) 


which evaluates to 
x(t) = Z [sin (VK./Mr) sin (VK./MT.) 
+ cos (VK./Mr) cos (VK./MT. ) 
-cos(VKo/Mt)| 1 >To. (10.223) 


A plot of x(z) is shown in Figure 10.20. Notice that x(t) and dx(t)/dt are continuous 
everywhere, and d*x(t)/dt? is continuous everywhere, except at t = 0 and t = 7», 
where the driving force changes discontinuously. 


x(t) 


Figure 10.20 Response of Harmonic Oscillator to a Square Pulse of Force 
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10.7.2 A More General Approach 


There is another approach to Green’s functions which requires only that the differ- 
ential equation be linear, and that its solutions satisfy what are called homogeneous 
boundary conditions. A general, second-order, nonhomogeneous equation has the 
form 


2 
‘ oo + B(x) we ) + CQr(e) = d(x). (10.224) 
dx 


A(x) 


We are not going to assume the independent variable is causal, and to emphasize 
this we have used x as the independent variable instead of t. The coefficients are 
no longer constants, but can be a function of the independent variable. The three 
functions A(x), B(x), C(x), and the driving function d(x) are presumed to be known. 
In addition to this differential equation, two boundary conditions are necessary to 
make the solution r(x) unique. These boundary conditions usually involve r(x) or its 
first derivative dr (x)/dx evaluated at the same or different values of x. We will begin 
by looking at the consequences of the linear nature of the equation and develop a 
Green’s function solution for r(x). Then the question of the boundary conditions will 
be addressed. 

Because the differential equation we are dealing with is linear, a superposition 
principle applies to its solutions. That is to say, if a drive d\ (x) produces the response 
r;(x), and another drive d(x) produces the response r2(x), then the response to 
a drive function d(x) = Ajd;(x) + Azd2(x), where A; and A> are constants, is 
simply r(x) = Ar1(x) + Air2(x). This, of course, assumes the boundary conditions 
are satisfied for all three responses. Now assume that the drive is a 5-function 
d(x) = 8&(x), and that this drive produces a response r(x) = g(x). Next, shift the 
driving delta function to x = €, so that d(x) = 5(x — €). The response to this shifted 
8-function drive will depend upon é, but because the coefficients of the differential 
equation are not constant, it cannot be assumed that the response is just g(x) shifted 
by €. We will represent the response to the shifted 5-function drive by the Green’s 
function g(x|£), which is pronounced “g of x given &”. This function satisfies the 
differential equation 


def) 


+ B(x)" + C(xda(xlé) = 5(x — €). (10.225) 


dg(xlé) 
AQ)—Fa 


The superposition principle then says, if the drive is a discrete sum of weighted, 
shifted 5-functions, 1.e., 


d(x) = 5_ And(x — &), (10.226) 


the response is a sum of Green’s functions weighted with the same A,,: 


r(x) = So Ang(1&)- (10.227) 
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r(x) = g(xlg) 


d(x) = 8(x-) - 


d(x) = A,8,(x-&) +A,8,(x-&) 


(§ & 


Figure 10.21 The Superposition Principal 


This superposition action for a drive composed of the discrete sum of two 6-functions 
is shown in Figure 10.21. If d(x) goes to a continuous sum of weighted 6-functions, 


a(n) = [dg aoe &) (10.228) 
the response becomes a continuous sum of Green’s function with the same weighting: 
r(x) = fi d& A(E)g(x1&). (10.229) 


An arbitrary function d(x) can always be written as a continuous sum of 6- 
functions: 


d(x) = f. dé d(€)8(x — 6). (10.230) 


This means that, given any driving function d(x), the solution to Equation 10.224 can 
be written as 


r(x) = / dé d(é)g(alé). (10.231) 


The Green’s function g(x|é) is the solution to Equation 10.225, provided the boundary 
conditions work out. 
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Equation 10.225 cannot be solved for the Green’s function without specifying 
boundary conditions for g(x|). Therefore, to obtain the Green’s function and eval- 
uate the response in terms of the Green’s function integral of Equation 10.231, the 
boundary conditions for g(x|é) must be determined so that Equation 10.231 pro- 
duces an r(x) that still satisfies the original boundary conditions of the problem. Zero 
boundary conditions on r(x) are easy to handle. By zero boundary conditions we 
mean that r(x) or a derivative of r(x) is zero at some value of x, say x = Xp. If this is 
the case, these boundary conditions on r(x) will be established by a Green’s function 
that obeys the same boundary conditions, that is g(x]€) or its derivative is zero at the 
same x = Xo, for any €. These zero boundary conditions are specific examples of 
homogeneous boundary conditions, which are discussed below, first for the specific 
case of a second-order differential equations, and then in the general case. 

For a second-order linear differential equation, two boundary conditions must be 
specified. They generally involve the response and/or its first derivative, evaluated at 
the same or two different values of the independent variable. The most general forms 
for homogeneous boundary conditions for a second-order equation are 


a r(x)| _ +a “ =0 (10.232) 
By ro) + pe =0, (10.233) 
x=b 


where a, @2, 8; and PB, are constants. If these types of boundary conditions are 
applied to a g(x|é), it is easy to show that the r(x) generated by Equation 10.227, 
and consequently Equation 10.231, will satisfy the same homogeneous boundary 
conditions. The homogeneous boundary conditions of the Laplace Example 10.9 fit 
this form with a = b = 0, a, = B2 = 1, anda, = B, = 0. 

For a linear, nonhomogeneous differential equation of n” order, there must be n 
boundary conditions that may involve the response up to its n — 1 derivative. For an 
n'*_order equation, the general form of the homogeneous boundary conditions are n 
different equations: 


n i-1 

a a = 0. (10.234) 
i=! x=a 

The first equation involves a set of constants a; and the function and/or its derivatives 

are evaluated at x = a; the second, a set of constants 8; and terms evaluated at x = b; 

etc. 

The Green’s function argument is now complete. Given a linear, nonhomogeneous 
differential equation in the form of Equation 10.224 and a complete set of homo- 
geneous boundary conditions in the form of Equation 10.234, the response can be 
obtained from a Green’s function integral. The Green’s function g(x|&) is the solution 
to the original differential equation with a 5-function drive and satisfies the same set 
of homogeneous boundary conditions. 
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This derivation for the Green’s function solution is correct, but a little sloppy. 
In Appendix D there is a more formal, but far less intuitive proof for the existence 
of Green’s functions and for the properties of homogeneous boundary conditions. 
These proofs are for second-order differential equation, but are easily extended to 
linear equations of any order. 


Example 10.10 We will now investigate a classic problem which uses a Green’s 
function solution. In this example, space is the independent variable instead of time. 
Imagine a uniform, elastic string, stretched between two fixed points, as shown in 
Figure 10.22. The ends of the string, at x = 0 and x = L, are both held at y = 0.A 
distributed force per unit length F(x) is applied along the string. Our sign convention 
is that a positive force points in the negative y direction. Assume the string has a 
uniform tension 7,, and that the displacement caused by F(x) is so small that this 
tension does not change. 
The differential equation describing the displacement of the string is 


d?y(x) 
dx? 


To = F(x), (10.235) 
and the boundary conditions are homogeneous, with y(0) = y(L) = 0. Because 
this is a linear differential equation, it can be solved using a Green’s function. The 
displacement becomes 


L 
yea) = [dg F@gta16) (10.236) 
0 
where g(x|é) is a solution to 
2 
7,4 8218) © 5 — 8, (10.237) 
dx2 


with g(O|E) = g(L|é) = 0. Because the Green’s function obeys homogeneous bound- 
ary conditions, the solution it generates will obey the same homogeneous boundary 
conditions. 


F(x) 
Figure 10.22 The Stretched String Problem 
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F(x)=8(x-) 


Figure 10.23 The Green’s Function Displacement of the String 


The interpretation of Equation 10.237 is simple and is depicted in Figure 10.23. 
The driving force is a single 6-function located at x = &. The response to that 
driving term is the Green’s function g(x|¢), which is zero at x = 0 and x = L for all 
0 < & < L. Notice, in the small displacement approximation, the vertical components 
of the tension add to cancel the applied 5-function force per unit length. The fact that 
the horizontal components of the tension do not exactly cancel is ignored. 

The Green’s function for this problem can be obtained by solving Equation 10.237. 
Everywhere, except exactly at the point x = &, the equation for g(x|é) is 


2 
rt gOxlé) _ 0, (10.238) 
dx? 
which has the simple solution 
g(xlé) = Ax + B. (10.239) 


Because there are two different regions that have this solution, separated by the point 
x = €, we can write the general solution for g(x|&) as 


Aix + B, O<x<€& 


alg) = ie £B Exe ee (10.240) 


where the constants A; through B, must be determined. 
The boundary conditions at x = 0 and x = L give two equations involving these 
constants, 


A,(0) + B, =0 (10.241) 
and 

A2(L) + Bp = 0. (10.242) 
A third equation is obtained by requiring g(x|é) to be continuous at x = €: 


A,(€) + By = A2(&) + Bo. (10.243) 
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In other words, the string does not break due to the force. The fourth and final equation 
takes a bit more work, and involves the driving term 5(x — &), which has not yet been 
used. Start by integrating Equation 10.237 from x = € — € tox = € + € in the limit 
of e > 0: 


ét+e 2 E+E 
| dx 7,7 88) = | dx &(x — &). (10.244) 


= dx? =e 
The LHS of this equation goes to T, times the discontinuity in slope of g(x|é) at 
x = &, and the RHS is one because we are integrating over a 6-function. The result 
is 


dg(xlé) 
° dx 


_ dg(xlé) 


=1. 10.24 
ae (10.245) 


é+e é-e 


Taking the limit as e — 0, and referring back to Equation 10.240, gives 
T, (A2 — Ai) = 1. (10.246) 


Notice that this equation is essentially a statement that the vertical forces at x = 
must cancel. Equations 10.241-10.243, along with Equation 10.246, constitute a set 
of four independent equations that uniquely determine A, A2, B), and Bz. Equation 
10.240 for the Green’s function becomes 


g(xlé) = { —[x/(ToLML - €) = gle) O<x<é 


(E/T, DVL —- x) = gl) E <x <L* (10.247) 


Notice that this Green’s function does not have the simple form g(x — &) of a 
translationally invariant solution. A quick look at Figure 10.23 and the boundary 
conditions at the ends of the string show why this must be the case. 

This Green’s function can now be used in Equation 10.236 to obtain the displace- 
ment of the string for an arbitrary loading force density F(x). Some care, however, 
must be taken in setting up this integral. For this integration, x is held fixed, some- 
where between 0 and L, while the integration variable ranges from 0 to L. According 
to Equation 10.247, g(xlé) = go(xlé) when & < x and g(x|&) = g1(xl&) when € > x. 
The integration of Equation 10.236 must therefore be broken up into two parts. The 
Green’s function solution for an arbitrary F(x) becomes 


7 Xx -€ L 
y(x) i dg 7 (L ~ x)F(E) + i dé 


—x 
ToL 


(L — €)F(€). (10.248) 


10.7.3 Nonhomogeneous Boundary Conditions 


Up to now, the Green’s function analysis has been for problems with homogeneous 
boundary conditions. In this section, we will review, briefly, why homogeneous 
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Figure 10.24 Homogeneous Boundary Conditions 


boundary conditions are important. Then a method for dealing with nonhomogeneous 
boundary conditions will be introduced. 

In the case of the stretched string problem, the example of the previous section, 
the homogeneous boundary conditions specified that the response be zero at either 
end of the string. These boundary conditions fit the general homogeneous form 
of Equation 10.234. We required the Green’s function to obey the same boundary 
conditions. This works because when the Green’s function is constrained to be zero 
at x = 0 and x = L, as shown in Figure 10.23, an arbitrary sum of Green’s functions 
will automatically obey the same conditions. As an example of this, the sum of 
three Green’s functions is depicted in Figure 10.24. Therefore, forcing the Green’s 
function to obey the same boundary conditions as the original differential equation 
is the proper approach in this case. 

The difficulty with nonhomogeneous boundary conditions is easily demonstrated 
with the string problem by changing the second boundary condition so that y(L) = yp. 
A generic load and displacement which obeys this new condition is shown in Figure 
10.25. Problems immediately develop if these boundary conditions are applied to the 
Green’s function, and the response is generated with a summation of several of them. 
A sum of three of these Green’s functions is shown in in Figure 10.26. As with Figure 
10.24, here F(x) is composed of three weighted 6-functions. The response to each 
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Figure 10.25 Loaded String with Nonhomogeneous Boundary Conditions 


of these individual forces is shown on the left, each one proportional to a Green’s 
function which satisfies the nonhomogeneous boundary conditions. If these solutions 
are simply summed, as would be done by the Green’s function integral, the result 
would not satisfy the required boundary conditions. This is shown on the right of 
Figure 10.26 where the three terms have been summed to give y(L) = 3 yp. Clearly, if 
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Figure 10.26 The Nonhomogeneous Boundary Condition Problem 
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g(x|€) satisfies the nonhomogeneous boundary conditions, the correct answer is not 
generated by the Green’s function sum. 

Fortunately, there is a way around this problem. It is clear from the picture above 
that to keep the generic Green’s function integral 


b 
r= i dé d(ée(elE) (10.249) 


from diverging for x = a or x = b, the Green’s function must be forced to have 
homogeneous boundary conditions. But we can add to this function another term, 
which makes the solution obey the nonhomogeneous boundary conditions. 

To see how this works, let’s return to the string problem with the nonhomogeneous 
boundary conditions y(0) = O and y(Z) = y,. The solution y(x) still satisfies the 
differential equation 


2 
7,42 = Fe), (10.250) 
dx? 
Now break y(x) into two parts, 
yx) = yi(x) + yo), (10.251) 


where y;(*) satisfies the nonhomogeneous differential equation, Equation 10.250, 
but has homogeneous boundary conditions, i.e., y,(0) = 0 and y,;(L) = 0. Let y2(x) 
satisfy the homogeneous version of Equation 10.250 


d? yo(x) 
To dx2 = 0, 


(10.252) 


with the nonhomogeneous boundary conditions, y2(0) = 0 and y2(L) = yo. The 
sum of these two solutions, y,(x) + y2(2), will satisfy both the nonhomogeneous 
differential equation and the nonhomogeneous boundary conditions. The y(x) part 
of the solution can be found with the standard Green’s function approach, by requiring 
the Green’s function to satisfy homogeneous boundary conditions. The y2(x) part is 
obtained by solving a simple homogeneous differential equation and then applying 
the nonhomogeneous boundary conditions. 
In this case, y)(x) is the same solution found in Equation 10.248: 


oe bs 36. - Sie. 
n= [agrorte-n+ [ deP@TAL-H. 10.253) 
The y2(x) part of the solution, the solution to Equation 10.252, is simply 
yo(x) = Ax + B. (10.254) 


Applying the nonhomogeneous boundary conditions forces the constants to be A = 
yo/L and B = 0, so that the total solution is 
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i. x _ é L =< Yo 
y(x) = i dé FOp x)+ | dé FOr et é)+ he (10.255) 

In summary, to solve a linear, nonhomogeneous differential equation with non- 
homogeneous boundary conditions, break the problem into two parts. Use a Green’s 
function approach to solve the original, nonhomogeneous differential equation using 
a Green’s function which satisfies homogeneous boundary conditions. The desired 
solution is then the sum of this solution and the solution to a homogeneous version 
of the differential equation, which obeys the nonhomogeneous boundary conditions. 


10.7.4 Multiple Independent Variables 


To this point, all our Green’s function examples have had a single independent vari- 
able. How are things modified when we consider higher dimensions? In this section, 
through examples, we will explore two-dimensional solutions for two common prob- 
lems, diffusion and wave propagation. Both examples have one space dimension and 
the time dimension. 


Example 10.11 The Green’s function approach can also be used to solve linear 
partial differential equations. Consider the diffusion of heat along the infinite, one- 
dimensional conducting rod shown in Figure 10.27. The homogeneous diffusion 
equation 


pe PT(x,t) AT (x,t) _ 


0 10.256 
ox? ot ( ) 


describes the time and space dependence of the temperature T(x, t) along the rod. 
The constant D is called the diffusion coefficient. A common way to formulate a 
diffusion problem is to specify T(x,0) = T,(x), the initial temperature distribution 
at t = 0, as shown in Figure 10.27, and then solve for T(x, t) for t > 0. It will be 
assumed that T(x,t) -~ 0as x — + for all t. 


| T%,0) = To) 


Figure 10.27 One-Dimensional Conducting Rod with Initial Temperature Distribution 
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At first, this does not look like a problem that could be solved using a Green’s 
function method, since the differential equation is homogeneous and there is no 
driving term. It is, however, a linear equation, so the superposition principle does 
apply. That is to say, if T,(x) = 5(x — €) results in T(x,t) = g(alé,t), then 


T(x) = So And(x — &) (10.257) 


will result in 


T(x,t) = > Angal&, 1) (10.258) 


for all t > 0. Taking these summations to continuous integrals allows us to argue 
that, if 


foo) 


T(x) = i; dé T,(£)5(x — &), (10.259) 
then 
T(x,1) = / dé To(E)g(xlé, 1) (10.260) 


for all t > 0. Notice how we have set up the arguments of the Green’s function. The 
variables x and & have been used as standard Green’s function variables, while t is 
not involved in the Green’s function integral of Equation 10.260. This is indicated by 
the notation in the argument of the Green’s function. Only variables immediately to 
the right of a “|” will take part in Green’s function integrations. 

So the solution to the problem can be formulated using a Green’s function, with 
the initial temperature distribution acting as the weighting function in the Green’s 
function integral. The boundary conditions require T (x,t) — 0 as x — +o and are 
homogeneous. Consequently, the same boundary conditions are applied to g(xlé, 1). 
This Green’s function must satisfy 


p22 Balt) _ dgelé.t) _ 


10.261 
Ox? ot 0, An0 ) 
with the initial condition at t = 0 
g(xl€,0) = d(x — €) (10.262) 


and the boundary conditions described above. Notice in this problem, the initial 
condition is treated very differently from the boundary conditions. 

One method to solve Equation 10.261 is to use a Fourier transform in space. A 
standard Laplace transform cannot be used because the solution must be valid for 
all x. We define 


G(Klé,t) = = / ‘ dx g(xlé, He, (10.263) 
TT J—a 
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and the Fourier transform of Equation 10.261 becomes 


to] yt 
~ DR’ G(KE, t) = _ (10.264) 
This is an easy first-order equation, with the general solution 
GE, 1) = Ge", (10.265) 


where the complex constant G, = G(klé, 0). But g(xlé,0) = &(x — &), so 


G, = F {g(alé, 0)} 


1 % 
<= | dx (x — €)e* 
V2 0 
seahorse, (10.266) 
V 20 


The Fourier transform of the Green’s function becomes 


Gal, t) = 


1 : 

ee grr (10.267) 
21 
and the Green’s function, itself, is the Fourier inversion of this expression: 


eb DPR t pike (10.268) 


1 ie 1 
t= ——_ dk —— 
(al. #) J 2 - V20 


This inversion can be done most easily using a convolution trick. The Fourier 
ransform of this Green’s function can be looked at as the product of two functions 


G(klé,t) = FRE, OH(ALE, 1), (10.269) 
where 
1 ‘ 
F(kKlé,t) = ——=e 10.270 
F(Alé, t) an ( ) 
ind 
H(klé,t) =e P*, (10.271) 


The Green’s function is therefore 1/./27 times the convolution of the Fourier 
nversions of Equations 10.270 and 10.271. Convolution is usually a messy process, 
yut not in this case because the inversion of 10.270 is just a 5-function 


F lage = &(x — é), (10.272) 
vie 
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and 6-functions convolve easily. The second function, Equation 10.271, is a Gaussian 
and it inverts to another Gaussian: 


FMenPPY = ae SNe, (10.273) 
t 


The Green’s function becomes a convolution over x: 


a(alé, 2) 7 soem @ &(x-é) 


Il 


a 3 dx! ee &—#Y 14D") 5(x! — é) 

V4nD*t J—o 

= heeds e EP HAD (10.274) 
V4rD2t 


Equation 10.274 describes how an initial temperature distribution T(x,0) = 
5(x — &) evolves in time and space. A graph of this function is shown in Fig- 
ure 10.28. Notice that in this case g(x|£,t) has the form g(x — &,t), and thus the 
Green’s function is translationally invariant. As the initial 5-function shifts in space, 
the response shifts correspondingly. This happens because the differential equation 
has constant coefficients and the homogeneous boundary conditions are located at 
infinity. 


g(xl€,t) 


Figure 10.28 Green’s Function for the Diffusion Equation 
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The general response for an arbitrary initial temperature distribution 7,(x) is then 


T (x,t) = dé T,(&) e— 7 /40"9), (10.275) 


1 a 
V4nD21 J-« 


Notice the integration of Equation 10.275 is over only the x variable. The ¢ variable 
just hangs around as a simple time dependence of the Green’s function. In the next 
example, both time and space are treated as Green’s function variables, and the 
response will involve an integral over both of these quantities. 


Example 10.12 We now consider the same one-dimensional, conducting rod as we 
used in the previous example, but this time with an external driving term. Imagine a 
special heating element, running through the center of the rod, which can add heat in 
an arbitrary way that depends both on x and t. The heat source s(x, t) turns Equation 
10.256 into a nonhomogeneous differential equation: 


OT (x,t) 9 0°T (x,t) 
ee 


5 —— 5(x, t). (10.276) 


The signs have been adjusted in this equation so that a positive source term produces 
a positive change in temperature. The solution still must obey the homogeneous 
boundary conditions, T(x,t) > 0 as x — +o. In addition, we will also assume 
that the rod obeys causality with respect to the time variable. That is, there can be 
no response in the rod before the heat source is applied. The source term s(x, ft) is 
assumed to be zero for t < 0, so that T(x,t) = 0 fort < 0. 

Because Equation 10.276 is a linear equation with homogeneous boundary con- 
ditions, it must obey the superposition principle. Thus, if a drive of s(x,t) = 
8(x — €)6(t — 7) results in T(x, 1) = g(xlé, t]7), then a drive of 


s(x, t) = 3 So Anm8(x — &)5(t — tm) (10.277) 


n 


causes a temperature response of 


T (x,t) = $0 So AnmBalén, ttm). (10.278) 


This reasoning can be extended for continuous quantities by converting the summa- 
tions into integrals. So if the drive is now 


5(x,t) = ‘| az [ dt (&, T)8(x — E)S(t — 7), (10.279) 
the response is 


T(x,t) = ‘a dé ig dt s(&, t)g(x1€, t\7). (10.280) 
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The Green’s function for this problem is the response of the rod to a 6-function 
impulse at position x = & and time ¢ = 7. It obeys the partial differential equation: 


2 
Sa(elé tt) _ pp Matter) _ 50, _ e534 — 2). (10.281) 
ot ax? 

The homogeneous boundary conditions T(x, t) must satisfy are also applied to the 
Green’s function, so g(xJé, |r) — 0 as x — +0, The causality condition requires 
that g(alé, lr) = 0 for t < +. To solve Equation 10.281, we will take a Fourier 
transform in space, and a Laplace transform in time. Taking these transforms of the 

Green’s function in two steps gives 


Gp(HE,tIr) = z / «dr glé,thye™ (10.282) 
Gri (HE, sh0) = | dt Gp(HE, tne". (10.283) 


The Fourier-Laplace transform of Equation 10.281, the original differential equation, 
gives the algebraic relation 


sGp,(Ké, slt) + DG, (ME, |) = — 


Te (10.284) 
T 


This allows us to solve for the Fourier-Laplace transform of the Green’s function 


eke est 
Gr (KE, = ————__.. 10.285 
Gr (ké, sl7) Jame + DR) ( ) 


Now all that remains to obtain the Green’s function is the double inversion of 
Equation 10.285. In principle, the inversions could be done in either order. In this 
case, it is much easier to do the Laplace inversion first. 

The Laplace inversion can be evaluated from 


e iké est ef 


1 
Gr(Ké, tlt) = — | ds —==—--—__.. (10.286) 
e 2mi Ie \/2n(s + D2k?) 
There is a single pole at s = —D*k*. Because the variable k is always on a Fourier 


contour, it is pure real. The diffusion coefficient D is also pure real. Therefore, this 
pole is always on the negative, real axis in the complex s-plane, as shown in Figure 
10.29. The Laplace contour is always to the right of this pole. The integral in Equation 
10.286 can be performed using closure, closing to the right for f < 7 and to the left 
fort > 7, with the result 


0 t<T 
Ge (klé, tlt) = {ayia ee (10.287) 


Equation 10.287 can be written in more compact form using the Heaviside step 
function, 
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figure 10.29 Laplace Inversion for the Nonhomogencous Diffusion Equation Green’s Func- 
ion 


H(t—1) = o ; 5 Ma (10.288) 
is 
1 - i ,-DR(- 
G,(klé, tlt) = ——H(t — rhe e-PRO, (10.289) 
V2r 


The space-time Green’s function is now obtained by applying a Fourier inversion to 
3quation 10.289: 


g(xlé, tlt) = A(t - Den | dk el @-8)e-DR 7) (10.290) 


Jne way to solve this integral is by completing the square of the integrand’s exponent. 
{nstead, we will use the convolution trick again. Notice the function in Equation 
10.289 can be written as the product of two functions: 


ee 


[i = Tin 
7 


| je oe 9), (10.291) 


Consequently, the Green’s function can be written as 1 /./27r times the convolution 
over x of their inverse transforms: 


1 


1 
; = a A —_— ts) — ee 
a(xlé,tl2) al @- N80 ~ )® Gram 


eV AD)! (40.292) 
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Figure 10.30 Green’s Function for the Diffusion Equation with a Distributed Heat Source 


Because of the 5-function, this convolution easily evaluates to 


g(al€, tr) = HEAD 6 F APU, (10.293) 


V4aD(t — 7) 


A plot of this Green’s function is shown in Figure 10.30. 

There are some interesting things to notice about this Green’s function. First, the 
t|7 dependence has the form t — 7 and the x|€ dependence has the form x — €. These 
results make sense both mathematically and physically. Mathematically, this occurs 
because the problem has constant coefficients and homogeneous boundary conditions 
atx = +o, Physically, it is clear from symmetry, the problem must be translationally 
invariant in both time and space. Also notice there is a symmetry with respect to 
the x and é variables, with g(x|é, tlt) = g(élx, t|7), while there is no such symmetry 
for the ¢ and 7 variables, g(xlé, tlt) * +g(xlé, r]t). This symmetry property will be 
explored in one of the exercises at the end of this chapter. . 


Example 10.13 Asa final example, the Green’s function for a driven wave equation 
will be developed. The one-dimensional driven wave equation for propagation in a 
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uniform medium can be written as 


1 d?u(x,t) u(x,t) 
cz ot? ax? 


= s(x, f). (10.294) 


The constant c, is the velocity of propagation of the wave. This is a nonhomogeneous 
equation, with the response u(x, t) generated by the distributed source term s(x, t). 

This problem can be treated with a spatial Fourier transform and a temporal 
Laplace transform, exactly like the diffusion equation of the previous example. There 
is no problem with this approach. We will demonstrate a different approach, however, 
because it is one commonly used in physics calculations. To avoid the tedious Laplace 
transform, we will assume that the time behavior of both the driving term and the 
response are in the sinusoidal steady state, so all the time dependence occurs with a 
constant amplitude at a fixed frequency w,. In other words, we assume that 


s(x, t) = Real [S(x)e"] , (10.295) 
and the response has a similar time dependence 

u(x,t) = Real [U(x)e'®'] . (10.296) 
As you will see, this is very much like taking a Fourier transform in time. Unfortu- 
nately, you will also find out it presents a few problems! 


In this sinusoidal steady state, the differential equation becomes 


d?U(x) 
dx 


“i2 
SU = S(x). (10.297) 


Notice the time dependence completely cancels out, and we have a linear, nonhomo- 
geneous equation for U(x). Also, we now are dealing with only a single independent 


variable, so the partial derivative has been replaced by a regular derivative. The 
solution for U(x) can be formulated in terms of a Green’s function, 


U(x) = / dé S(E)e(2lE), (10.298) 


where g(x|é) satisfies the differential equation 


2 da 
(-2 7 =) gl) = a(x - 8. (10.299) 


We can take the Fourier transform of both sides of this equation to give 


1 


V20 


w2 . 
(-3 + ‘) G(klé) = eRe (10.300) 
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Figure 10.31 Fourier Inversion for the Driven Wave Equation 


so that 
en iké 
Gilg) = Vile a2) (10.301) 
The Green’s function is then obtained by the Fourier inversion 
Oe elk) 
Ko = 7, | ara pt sel en 


And now you should see the problem. The Fourier contour must remain on the 
real k-axis, but if it does, it runs directly into the two poles of the integrand! This 
situation is shown in Figure 10.31. Why did this problem arise, and how do we 
evaluate the integration near the poles? The answer to the first question is simple. 
We assumed the response was sinusoidal steady state over time and then carelessly 
took the Fourier transform over space. This spatial Fourier transform does not exist 
because the response to a sinusoidal steady-state source located at x = € has a finite 
amplitude for all x from minus to plus infinity. We can’t take the Fourier transform 
of such a function. There are two ways to fix this problem. One approach is to 
“chand-wave” your way to the correct answer using physical arguments. The other is 
to modify the problem, so we actually can take the spatial Fourier transform. Both 
will give the same answer. 

The hand-waving approach begins with the observation that what is being de- 
scribed by Equation 10.302 are the waves propagating to the left and to the right 
from a sinusoidal steady-state source located at x = &. We will make an attempt 
to connect the segments of the Fourier contour, shown in Figure 10.31, around the 
two poles. Then we will check to see if the result makes physical sense, given the 
physical situation. So let’s say the connections are made by small semicircles going 
above both poles, as shown in Figure 10.32. This contour can be closed in the upper 
half for x > &€ and in the lower half for x < & with the result that 


0 x> 
g(xlé) os [ico /2e)] — gi@olx—t)/co 4 er iente- Bea) x<€' (10.303) 


BOUNDARY CONDITIONS 401 


k-Plane 


Figure 10.32 First Guess for the Fourier Contour 


The first term comes from the pole at k = @,/co, and the second term comes from 
the pole atk = —w,/co. 

Remember the Green’s function is supposed to be the response to a 5-function 
source at x = &. If the sinusoidal steady-state time dependence is put back in the 
problem, then this solution would say that a source term of 


s(x, t) = Real[S(x — €)e!@'] (10.304) 
would give, according to Equation 10.303, a response of 
_ {0 x>é 
WD = 9 Real { [i/(2k,)] [—eiletthE-O) + eile bO-O]P gE? 
(10.305) 


where k, = w,/Co iS a positive constant. There is no response for x > &, and for 
x < € there are two wave responses, one from the k = w,/c, pole traveling to the left 
and one from the k = —w,/co pole traveling to the right, as shown in Figure 10.33. 
The only part of this solution that makes sense is the wave from the k = w,/co pole, 
traveling to the left, away from the source. The other wave in this region, x < €, is 
traveling toward the source, which makes no physical sense. It also makes no sense 
that there is no wave traveling to the right, away from the source in the region x > €. 

It is obvious, however, that all these shortcomings vanish if the “Fourier” contour 
is modified, as shown in Figure 10.34. Quotation marks are placed around Fourier 
because, strictly speaking, a Fourier contour exists only on the real axis. 


—No Response—— 


xX 


k =-@/c, wave k=@J/c, wave 


Figure 10.33. Response Using First-Guess Contour 
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k-Plane x>E 
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Figure 10.34 Proper Orientation of the “Fourier” Contour 


We can reach the same conclusion about how to modify the “Fourier” contour, a 
little more rigorously, if we introduce a damping term into the original differential 
equation. If the wave decays as it propagates, we will be able to legitimately take the 
Fourier transform over x. A simple modification of Equation 10.294 does the trick: 


= 5(x,1). (10.306) 


1 du(x,t) 97 u(x,t) du(x, t) 
ae + a—— 
cet” ax? at 


Now, if a is positive, the waves are attenuated as they move away from the source. The 
calculation proceeds as before, except now the damping term modifies the positions 
of the k-plane poles of Equation 10.302 and, for small positive a, places them as 
shown in Figure 10.35. There are no poles along the inversion contour, so everything 
evaluates easily. Then, finally, we can take the limit as a — 0 to see that the contour 
of Figure 10.34 is correct. In this limit, the poles in Figure 10.35 become arbitrarily 
close to the real k-axis, but the Fourier contour stays below the pole on the right and 
above the pole on the left. This is exactly the result of the hand-waving discussion 
that lead to Figure 10.34. 

So if we proceed with the inversion of Equation 10.302, using the contour of 
Figure 10.34 for a = 0, we obtain 


—|i/(2k,)| ew HOO x > E 
= [i/(2ko)] e tiko(x—€) x<é 


a(xlé) = { (10.307) 


Figure 10.35 Fourier Contour and Complex Plane Picture with Small Wave Damping 
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Source 


Figure 10.36 Undamped Waves Moving Away from Source 


When coupled with the sinusoidal time dependence, this solution behaves as shown 
in Figure 10.36. This solution describes undamped waves moving in both directions, 
away from the 6-function source located at x = &. 

For an arbitrary source S(x), the response is 


U(x) = i dé S(E)g(xlé) (10.308) 


\ 


‘ wt ikea 6) 4. 1 Tt ikla-8) 
[ es@xe | agg ee. 


The response to the original problem is obtained by coupling U(x) with the sinusoidal 
steady-state time dependence and taking the real part: 


u(x,t) = Real [Ucere'o']. (10.309) 


The Green’s function solution for this problem, Equation 10.308, is easier to 
understand if it is assumed that S(x) is pure real. Then 


s(x, t) = Real [see] 
= S(x) cos(ept), (10.310) 


and the response becomes 
" 1 
u(x,t) = / dé SMO sin [aot — k(x - é)] 
a dé Sz sin [wot + ko(x — €)]. (10.311) 


From this expression, it can be seen that the response is a sum of waves propagating 
away from the distributed source, toward the observation point x. 
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EXERCISES FOR CHAPTER 10 


1. Determine whether the following differential equations are homogeneous or 
nonhomogeneous, and linear or nonlinear: 


2 
= [ee] Oyj 2 =0 


” Gx |" ax d 
d [dy(x))? _ dy(x) _ 
il. £| oe 2y(x) x 


d?y(x) | dy(x) 

; +x +x 

dx? ax 
d*y(x) | dy(x) , 1 

god 

Vv. x27 +—=0 

Ne “dx y(x) 

2. The rate of evaporation for a constant-density, spherical drop is proportional 
to its surface area. Assume this evaporation is the only way the drop can lose 
mass and determine a differential equation that describes the time dependence 
of the radius of the drop. Solve this equation for the radius as a function of time, 
assuming that at ¢ = 0 the radius is ro. 


=0 


tii. x 


3. Using separation of variables, solve the differential equation 


dy(x) _ 2.2 
dx ay? 


with the boundary condition y(0) = 5. What happens when you try to impose 
the boundary condition that y(0) = 0? 


4. Consider the differential equation 


dy(x) __y 
dx x 

with the boundary condition y(1) = 1. 
(a) Solve this equation using separation of variables. 


(b) This equation can also be solved by converting it into an exact differential. 
Using the notation of Equations 10.21-10.24, identify the functions R(x, y), 
S(x, y), and @(x, y). Using the boundary condition, find a solution d(x, y) = 
C, that is equivalent to your answer in part (a). 


5. Consider the differential equation 


dy(x) * = cos (x”). 


dx 


(a) Determine an integrating factor for this equation. 
(b) Solve for y(x). 
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6. A resistor R and capacitor C are driven by a voltage source v(f), as shown in the 
circuit below: 


v(t) 


Cc q(t) 


The charge on the capacitor is given by the function q(t). For t < 0, the voltage 
source is zero and the charge on the capacitor is zero. The differential equation 
that governs the charge is 


ra + nat = v(t). 


(a) Find the integrating factor for this differential equation. 

(b) Using the integrating factor, express q(t) in terms of v(t). 

(c) Solve the equation of part (b) above if v(t) = 8(t). 

(d) Solve the equation of part (b) above if v(t) is a unit step applied at t = 0. 


7. Find the integrating factor and the general solution, with no specific boundary 
condition, for the differential equation 


1 dy(x) ‘ cos? x 


y(x) dx y(x) 


8. Consider the first-order differential equation 


dy(x) 
dx 


—xy(x) =x. 


(a) What is the integrating factor for this differential equation? 
(b) Find y(x) subject to the boundary condition y(0) = 0. 


9. Consider Bessel’s equation of order zero: 


2 
d*y(x) rf dy(x) 
ax? dx 


+ x y(x) = 0. 
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11. 


12. 


13. 


14. 


15. 
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Show that the Wronskian for this homogeneous differential equation has the 
form 


C 
W(x) = —, 
x 
where C, is a constant. 
What is the Wronskian associated with the differential equation 


d*y(x) dy(x) 
——— + ste = 

dx dx a= 
What is the Wronskian associated with the differential equation 


d*y(x) 
dx? 


+ y(x) sin (wx) = 0? 


Given that one solution of 


d*y(x) , 1 dy) _ 


dx? x dx x? I= 


is y(x) = x”, find the second solution. Using your result above, find a particular 
solution to the nonhomogeneous equation 


eye) , ge) _ 1 


Sh ae ae 


Use the method of Frobenius to find a series solution to 


d* d*y(x) dy(x) 
x? + 4x —— + = 
TE 4x x y(x) 
The Legendre equation has the form 
d*y(x) dy(x) 
— x2 a a 
(1 — x*) ; 2x + n(n + 1l)y(x) = 0 


Obtain the indicial equation used in the method of Frobenius to find a series 
representation for y(x). 


Show that the differential equation 


a a 
x oe + (1 - x) an ) + Ay(x) = 


has Frobenius polynomial solutions, as long as A is a nonnegative integer. De- 
termine the solution for A = 4 and with the boundary condition y(0) = 1. 
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16. The differential equation 


d?y(x) 
dx2 


—xy(x) = 0, 


possesses two independent solutions that are called the Airy functions. Use the 
method of Frobenius to evaluate these functions. What is the Wronskian of this 
differential equation? 


17. Using the method of Frobenius solve the differential equation 


Pe) d? d*y(x) oe) 


a tt 


+ y(x) = 0. 
18. Use the method of Frobenius to obtain two independent solutions to the equation 


2 d’y(x) y(x) 
dx ae 


2 oe 


2x + 2x + y(x) = 0 


Show that your solutions converge and prove they are independent. 


19. Use the method of Frobenius to generate a series solution for the differential 
equation 


dy(x) _ _ dy(x) 


dx3 * ax = 


Determine the generating function for the coefficients of the series and write out 
the first few nonzero terms. 


20. A tank car with a hole in the bottom is dumping its load so its total mass as a 
function of time is given by m(t) = m,(1 — aot), where a, and m, are constants. 
The car sits on a frictionless track and is attached to a spring with a spring 
constant Ky. 


X(t) 


The car is released att = 0 from x = 0 with an initial velocity v,. Assuming the 

spring is at equilibrium when the car is at x = 0, 

(a) what is the second-order, homogeneous differential equation that describes 
the motion of the car? 
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(b) is this a linear or nonlinear differential equation? 

(c) use the method of Frobenius to solve for x(t) over the interval0 <1 < 1 / Qo. 
(d) check the convergence of your solution. 

(e) what happens at t = 1/a,? 


21. The quantum mechanical interaction between two nuclear particles can be de- 
scribed by the one-dimensional interaction potential 


ew 


Develop a series solution for the wave function (x) that describes this system. 
V(x) satisfies the Schrédinger equation, 
Ww d*¥(x) 
2m dx? 


+ [E — V(x)} Vx) = 0, 
where £ is a constant representing the energy of the particle and m is its mass. 
Explicitly write out the first three terms of the series you obtain. 


22. Consider an undamped, undriven, frictionless mass spring system described by 
the homogeneous differential equation 


me) 
dt? 


+ Ky(t) = 0, 


where m is the mass and K is the spring constant. For this problem we will allow 
the spring to get stiffer as it is compressed. To accomplish this, let the spring 
constant K become dependent on the displacement such that 


K = Ky’, 


where K, is a positive constant. 


(a) At ¢ = O, the mass is released from x = x, with no initial velocity. Using 
the differential equation describing the motion, obtain an expression relating 
the velocity of the mass to its position. What is the acceleration of the mass 
when x = x,? 

(b) Make a graph of the velocity vs. position to show how the mass oscillates. 


23. A mass spring oscillator is set up with a magic spring, with a negative spring 
constant. It is frictionless and driven by an external force f(t) that is a function 
of time. The force from the spring on the mass is given by K,x(t) rather than 
— K,x(t). The position of the mass is x(t) and the initial conditions are that x = 0 
and dx/dt = 0 for all t < 0. 

(a) Write down the differential equation of motion for x(t), the position of the 
mass. 
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24. 


25. 


26. 


27. 


(b) Let the external force f(t) be given by 


So SiN Wot t>0 


so= 43 1<0° 


What is the Laplace transform of x(t)? 
(c) What is x(t) for large t? 


A driven, linear mass spring oscillator is described by the differential equation 
d?x(t) 
m9 + Kx(t) = f(t), 


where x(t) is the position of the mass as a function of time, and f(t) is the driving 
force 


oe (ER. 


For t < 0 the mass is at rest. 

(a) If x(t) has a Laplace transform X(s), what is the Laplace transform of 
d*x(t)/dt?? 

(b) What is x(t) for all ¢ > 0? 

A linear, frictionless mass-spring oscillator with mass m and spring constant K 

is at rest for t < 0 and is driven by an external force f(t). 


(a) If f(t) = 5(t), what initial conditions are established just after this 6-function 
force is applied? 


(b) If f(t) = dd(t)/dt, what are these initial conditions? 
(c) Finally, find the position and velocity of the mass for all ¢ = 0 if 


f(t) = ee t=0 


t<0¢ 


Solve the nonlinear differential equation 
1/2 47 y(x) 
[ya] ——— = 173, 
dx? 


given the boundary conditions y = 0 and dy/dt = 0 at x = x5. 


Find the unique solution for y(x) to the nonlinear equation 
d*y(x) _ 3 -1/3 
Sar Or 


with the boundary conditions y(0) = 0 and dy/dx|o = 0. 
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28. Use the method of quadrature to solve the equation 


d*y(x) 


z,2 ~ WO = 0, 


if y(0) = 1 and dy/dx|p = 1/2. 


29. Use the method of quadrature to solve the nonlinear equation 


d?y(x) _ 
dx? 


— cosy sin y, 
given the boundary conditions y(0) = 0 and dy/dx|p = 1. 
30. Use the method of quadrature to solve the differential equation 


d*y(x) _ siny 


dx? cos? y 


for y(x). Show, by substituting it back into the equation, that your solution works. 
Choose the constants of integration to give the simplest possible answer. 


31. This problem involves a Child—Langmuir solution for a two-component, anode- 
cathode gap. The geometry is described in the figure below. A large source 
of electrons exists at the cathode, and we assume the cathode temperature is 
sufficient for the electric field to be zero at t = 0. The same is true at the anode, 
except the particles there are positively charged ions. Let the electron have a 
mass of m— and a charge —q,, while the ion has a mass m, and charge +qp. 


Electron Ion 
cloud cloud 


Look for a steady-state solution such that 0/dt of all quantities are zero. Both 
electrons of charge density p_(x) and ions of charge density p+ (x) will be present 
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in the space between the electrodes, so the relevant Maxwell equation becomes 
Vv: €,E(x) = p+(x) + p(x). 


Use the method of quadrature to obtain a relationship between the current J, and 
the voltage V,. 


This problem is quite involved. Here are two hints to get you started: 
(1) The example worked out in this chapter arrived at the equation: 


ld [dd(QX)]* _ Jo [m . 17, db(x) 
55 (| €> ras (x) dx — 


The solution to this exercise works out best if an indefinite integral, with its 
constants of integration, is used to impose the boundary conditions, instead of 
the definite integral used in the example worked out in this chapter. Proceed 


as follows: 
1 do : = Jo m -1/2 
[4 3 (#) | ~ €\ 2Go [as 
PSO ro, ac 2 Te Pins ov 
1 (#8) 2k Frerre, 


The two constants can be combined 


1(db\?_ Jo [mip 
nee: ‘seeded = iene + > 
3 (#) eV 290° Co 


and the single constant C, evaluated by the boundary conditions, 


= do 
(0) = 0, Ge 


x=0 


For the example worked out in this chapter, C, = 0. For this exercise, this 
constant will not be zero. 
(2 


— 


In the chapter, we ended up with an integral in the form 


Vo 1 
i dd oe 


This integral was easy to perform. This problem does not work out as 
nicely, and the integral must be done numerically. For numerical integrations, 
variables are usually normalized so the results will be more general and can 
be easily scaled. For example, to normalize the above integral, we let 


(x) = Von(x), 
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33. 
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so that the normalized form becomes 


: 1 
3/4 
Ms Lf a al: 


The quantity in the brackets is referred to as a normalized integral because 
the variable 1 is dimensionless and the limits are from zero to one. The 
normalized integral for this exercise should come out to be 


a 1 
on ea 


Done numerically this integral has a value of about (4/3)+/ 1.86. 


For the string problem worked out in this chapter, we found that the Green’s 
function solution took the form 


x b 
yox) = [dk @entele + [dg FE erCae. 
Show that the derivative of y(x) can be expressed as 


dy(x) dgo(xlé) , dg, (xi§) 


= [ae 1) 


+ [des 2S 


For the string problem worked out in this chapter, the displacement of the string 
was governed by the differential equation 
d*y(x) 
T = . 
Fn f@) 


Assume the homogeneous boundary conditions y(0) = y(L) = 0, and use the 
Green’s function to find the displacement of the string if 


f) =F, [@-L/2y-1?/4) O<x<L. 


. Derive the Green’s function for the string problem again, using the boundary 


conditions 

(a) yO) = yi; y(L) = ee 

(b) y(0) = 0; dy/dx|; = 

(c) y(0) = 0; dy/dxl, = 

Show, with drawings, how i. string can be fastened at its end points to impose 
the three different boundary conditions described above. Why do the boundary 
conditions 


dy/dxl) = 0; dy/dxl, = 0 


cause problems with the Green’s function derivation? 
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35. Poisson’s equation can be viewed as a multidimensional form of the string 


36. 


problem. Two spatial derivatives of the dependent variable generate a function 
proportional to the drive 


Vor) = — Pe 
© € 


In this expression, P(F) is the electrostatic potential generated by a charge dis- 

tribution p(F). 

(a) Express a point charge in two dimensions using polar coordinates p and @. 

(b) Find the Green’s function for the two-dimensional Poisson’s equation using 
the polar coordinates. What boundary condition does this Green’s function 
satisfy? 

(c) Using your Green’s function, find a solution for ®(F) for an arbitrary p,(F). 

(d) Repeat the above steps for a three-dimensional solution using spherical 
coordinates. 

(e) Using your answer to part (d) above, find the solution for the potential of 
the dipole charge distribution shown below, where +q are point charges 
separated by a distance d. 


-q 


If time dependence is included in the description of an infinitely long, stretched 
string, the differential equation for its displacement becomes a wave equation 


a y(x,t) 1 A? y(x,t) _ 
ax? Ce a 
The string is stretched in the x-direction, and this equation describes perpen- 
dicular displacements in the y-direction. If there are no driving forces along the 
string, the differential equation is homogeneous, and excitations are established 
by the initial conditions at t = 0, which then propagate for t > 0. 
(a) Assume that the initial conditions are y(x, tf = 0) = yo(x) and dy/dt|,<o = 0. 
Find the Green’s function that allows the displacement of the string to be 
written as 


ye) = [ dé yo(é)g(xl€, 1). 
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(b) Plot your solution for the Green’s function vs. x for three values of f. 
(c) Find y(x, 1) if 


@= 1 ~-l<x<1] 
Yo) = 10 otherwise 


(d) Now find the Green’s function for the initial conditions y(x, ¢ = 0) = O and 
dy/dt\,-9 = Vo(x). Plot this Green’s function vs. x for three different values 
of t. : 


37. A linear, driven, mass-spring oscillator is acted upon by a standard — Kx spring 
force, an external force f(t), and a magic antifriction force a dx/dt where « is 
small, Assume the mass is at rest for t < 0 and its position is given by x(t). 


(a) Write down the differential equation of motion for this system. 


(b) Take f(t) to be zero for t < O and equal to F,sin(wot) for t > 0. Use 
Laplace transform techniques to obtain a solution for x(f) as tf — ~, 


(c) Sketch the Green’s function g(t ~ 7) vs. T. 
(d) If f(t) = ¢ fort > 0, what is x(t)? 


38. Consider the differential equation 


dr(t) 
dt3 


+r(t) = dO, 


where r(t) is the response to a drive d(t). The drive and response are assumed to 

be zero fort < 0. 

(a) What is the Green’s function for this differential equation? Can it be obtained 
with Fourier transform techniques? Can it be obtained with Laplace trans- 
form techniques? Discuss the initial conditions on r(¢) and its derivatives. 

(b) Sketch the Green’s function g(¢|7) vs. f. 

(c) If d(t) = t for t > 0, what is the response r(t)? 


39. Consider the differential equation 


d*r(t) 
7) r(t) = s(t), 

where r(¢) is the response to a source s(t), and the independent variable ¢ rep- 
resents time. Since this is a fourth-order differential equation, four boundary 
conditions are necessary to obtain a unique solution. For all parts of this problem 
assume that r(rt) and s(t) are zero for t < 0. 

(a) What initial conditions are established if s(t) = 5(t)? Assume these condi- 

tions are imposed just after the 5-function occurs. 
(b) Obtain an expression for the Green’s function g(¢|7) for this problem. 


(c) Make a plot of g(¢|r = 0) vst. 
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40. 


41. 


42. 


43. 


Find the Green’s function for the differential equation 
d*y(x) 
dx? 


over the interval 0 < x < 1, with the boundary conditions y(0) = y(1) = 0. 
Solve for y(x) if f(x) = sin (7x). 


+ y(x) = f(x) 


Consider the linear operator 


d? 
Lop = ae + 1 
and the differential equation 


Lop y(x) = f(x). 


For the initial conditions set y(0) = 0 and dy/dx|,-, = 0. 

(a) What is the Green’s function g(x|é) for this operator and what are its bound- 
ary conditions? 

(b) Obtain the Green’s function solution for y(x) as specifically as you can, 
while still leaving f(x) arbitrary. 

(c) Now set f(x) = 6(x — x,) and, using your answer to part (b), determine 
y(x). 

The equation of motion for a damped harmonic oscillator, driven by a force f(t), 

takes the form 


d*x(t) dx(t) _ 
m am + ae + Kx(t) = f(t) 
where the damping constant a is greater than zero, and K is the positive spring 
constant. Assume the drive and response are zero for t < 0. 
(a) Use Laplace transform techniques to obtain the Green’s function for this 
problem. 


(b) What is x(t) if 


F, 0<1<T, 
= o a9 
A) { 0 otherwise 


Consider the following circuit, where i,(t) is a current source, which is zero for 
t < 0. Assume that the response v,(t), which is the voltage across the capacitor, 
is also zero fort < 0. 


a. 


WTO ‘ gS ag) 
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(a) Obtain a nonhomogeneous differential equation which relates the response 
v,(t) to the drive i,(t). 

(b) Working in the time domain, determine the Green’s function for this problem. 

(c) Using your Green’s function form part (b), determine v, (t) if 


° 1,1? t>0 
Wo= {5 t<0°- 


(d) Now find the solution working in the frequency domain. Obtain the Laplace 
transform of the response V, (s) in terms of the Laplace transform of the drive 
1,(s). Use this result to obtain v,(t) for the drive given in part (c), above. Show 
that you obtain the same result as given by the Green’s function solution. 

(e) Show that the relationship between V,(s) and J ,(s) can be obtained by treating 
the complex impedance of the capacitor as 1/(Cs) and forming the parallel 
combination with the impedance of the resistor R. 

(f)} Using the complex impedance idea of part (e), what is the complex impedance 
of an inductor, L? What is the ratio V(s)/I(s) for the following circuit? 


i(t) 


+ 


L 


v(t) 
a an 
44. Consider the circuit, where v,(t) is a source voltage, and the current i, (t) is looked 
at as the response. 
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45. 


46. 


(a) Obtain the nonhomogeneous differential equation that describes the response 
i,(t) in terms of the source v,(f). 

(b) Working in the time domain, determine the Green’s function for this differ- 
ential equation. Assume that the source and response are zero for t < 0. 

(c) Using your Green’s function from part (b), determine i, (t) if 


Vot t>0 
w= {9° 1<0° 


(d) Now find the solution working in the frequency domain. Obtain /,(s), the 
Laplace transform of i,(f), in terms of V,(s), the Laplace transform of v,(¢). 
Use the same function for v,(t) as was used in part (c). Invert the Laplace 
transform for i,(f) to obtain the same answer you got using the Green’s 
function approach of part (c). 

(e) Show that by considering the impedance of the inductor to be Ls, the rela- 
tionship between /,(s) and V,(s) can be obtained by inspection. Notice that 
this is just the Laplace transform of the impulse response. 


Using the Green’s function technique, solve the differential equation 


d*y(x) 
dx? 


— By) = fi» 


over the interval 0 < x < L,, subject to the boundary conditions y(O) = y(L,) = 
0, using the following steps: 
(a) Show that the Green’s function for x < & is given by 


_ sinh (ky x) sinh (KoLy — kog) 
k, sinh (kL) 


g(xlé) = 


(b) Find the Green’s function for x > &. 
(c) Find y(x) in terms of f(x) using the Green’s function integral. 
(d) Find y(x) under the above conditions if 


f(x) = 58(x — 5) + 108(x — 10). 
(e) How does your answer to part (c) change if y(0) = O and y(L,) = 5? 
Consider the differential equation 


d dy(x) 
— Lao 


dx dx | ye) 


over the interval 0 < x < 1, with s(x) a known function of x. The solution y(x) 
is to satisfy the following boundary conditions 


y(0) > finite ; y(1)=0. 
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(a) What differential equation must be satisfied by 2(x|é), the Green’s function 
for this problem? 

(b) What is the general form of g(x|&) for x < € and x > €? 

(c) What conditions must g(x/) and its derivative with respect to x satisfy at 
x = &? 

(d) Using your results from part (c) and the boundary conditions, determine 
e(x/é) in the range 0 < x < 1. 

(e) Sketch g(xlé) vs. x for a fixed &, over the range 0 < x < 1. 


Find the Green’s function for the differential equation 


d?y(x) | dy(x) 
DE ey, = 
Pama +x —¥@) = d@) 
subject to the boundary conditions y(0) = y(1) = 0. The Green’s function must 
satisfy 


Yh d 
c pares | eal) = 8x ~ &). 


It may be easier, however, to solve for the Green’s function from 


BP ae at 1 
E dx* = dx ;| a(xlé) x oes) 
This problem involves the diffusion of magnetic field into an electrically conduct- 
ing material. The material is characterized by a constant electrical conductivity 
Oo, and magnetic susceptibility jz, and exists in the half-plane x > 0. Free space 
filis the region x < 0. 


The electric E and the magnetic B fields in the conducting material are governed 
by Maxwell’s equations in the diffusion limit 
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49. 


V xB = o,n,E 

vxE--8 
ot 

V-B=0 


Treat the material as if it were one-dimensional (i.e., all quantities have 0/dz = 0 
and d/dy = 0) and use these equations to obtain the diffusion equation 


a B(x, t) -. dB(x, t) 
ax? eet sor © 
Now consider the situation where the magnetic field is controlled in the free 
space Tegion so that B(x, ) = B,(t)é, for x < 0. A Green’s function solution for 


B(x, t) in the region x = 0 is sought, so that the solution for the magnetic field 
can be placed in the following form: 


Bix, 1) = K dt Bo(7)g(x, tlr)éy. 


(a) What differential equation and what boundary conditions at x = O must 
g(x, t|7) satisfy? 

(b) Determine G(x, w|7), the Fourier transform of g(x, t\7). 

{c) Describe the restrictions that must be placed on the function B,(t) in order 
for the above solution to be valid. 


Consider the one-dimensional wave equation 
a 1 & 
ox? 2. at? 


| r(x,t) = s(x, t), 


where the source term s(x, f) and the response r(x, t) are both zero fort < 0. 


(a) Obtain the Green’s function g(x|é, t|7) for this problem that allows the re- 
sponse to be written as 


ran = ag | dt s(€, T)g(a1€, t|7). 


(b) Plot g(x], ¢|7) vs. x for three different values of t, one value less than 7 and 
two values greater than T. 


Note that if f(x) is a unit step from —a < x < a, the Fourier transform of f(x) 


18 
are /2 sin (ka) 
T k 
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50. In the chapter, we derived the Green’s function for the wave equation in Equation 
10.307. 
(a) How is g(xlé) modified if we have the additional boundary condition 
g(x = O|€) = 0? 
(b) Make a plot of this Real[g(xlé)e““*’] for several different times—as many as 
are needed to show its character. 
(c) What physical situation does this new boundary condition represent? 


51. Infree space, electromagnetic waves propagate according to Maxwell’s equations 


V x Ee, 1) = ~ eee) 
ot 
V x A) = + SAD. 


which can be combined to give the wave equation 


4 F Hoot) _ 


52 0. 


-VWEG, 1) 


If, instead of free space, the wave is propagating in a dielectric medium that is 
spatially dependent, i.e., 


€y9 — € + €(F), 
(a) Show that the wave equation becomes 


2” MoSoE(F,t) | FP Hoe(MEC,1) _ 5 


_~U2Riz + 
WEE) or or? 


(b) Consider a wave propagating at a pure real frequency w,, so that E(F, t) — 
Er) e’“°’. This is like taking a Fourier transform in time. Show that the wave 
equation becomes 


—VE(P) — 02.6 .E(F) = +2 poe(FE(P). 


Now let e(F) = &(x) to model a sheet of infinite dielectric in the x = 0 plane. 
Assume there is no variation of the electric field in the y-direction, and that the 
spatial dependence of the field can be expressed as 


Bre — ik, x ike2 4 
E® = E,e** e%8,. 


This is like taking Fourier transforms in the two spatial directions. 

(c) Find k, and k, for x > Oand x < 0. 

(d) Can this solution for the electric field be used as a Green’s function for 
finding the solution for an arbitrary e(x)? 
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53. 


54, 


This problem looks at the diffusion of heat along an infinite rod with the temper- 
ature at x = 0 controled for all time, 


T(x =0,) = a oe 


The standard diffusion equation applies 


a°T (x,t) aT (x,t) 
Pee NS) 
Ox? ot 


> 


which, with the above boundary condition and the assumption that T(x, ft) — 0 
as |x| —» ©, uniquely determines T(x, t) for all x and t > 0. The solution for 
T (x, t) can be obtained from a Green’s function integral 


T(x,t) -[ dt f(t) g(x, t{r). 
0 


(a) Identify the differential equation and boundary conditions that g(x, t|7) must 
satisfy. 

(b) Determine g(x, t]7). The following Laplace transform will probably be of 
help: 


ps e P14) Le kVS (kk > 0). 


2V mB 


A differential equation is said to be in Sturm-Liouville form if it can be written 
as 
d 
dx 


dy(x) 
dx 


loos + q(x) y(x) = s(x), 

where a(x), b(x), and s(x) are known functions of the independent variable. 
All linear, second-order differential equations can be placed in this form. This 
particular form has many advantages, as will be shown in the next chapter. The 
differential equation 


bye) dy(x) 
dx? dx 
can be placed in Sturm-Liouville form by multiplying both sides of this equation 
by a function f(x). 
(a) What f(x) will accomplish this? 
(b) What is the final Sturm-Liouville form for this equation? 


+ (x? cos x + 2x) + x y(x) = d(x) 


If a particular differential equation is not in Sturm-Liouville form (see Exercise 
10.53), the Green’s function for the equation in the non-Sturm-Liouville form 
will be different from the Green’s function for the equation in Sturm-Liouville 
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form. Consider the differential equation: 


2 PO 4 2 PE _ yyy = av, 

with the boundary conditions y(0) = y(1) = 0. 

(a) What is the Green’s function for this problem as described by the differential 
equation given above? 

(b) Place the differential equation in Sturm-Liouville form. 

(c) What is the Green’s function for the differential equation in Sturm-Liouville 
form? 

(d) Formulate the response of the system to a drive d(t) using the Green’s 
functions obtained for parts (a) and (c) above and show that these responses 
are identical. 


55. In this chapter, we have seen cases where the Green’s function was symmetric 
with respect to x and &. That is to say, 


B(alé) = g(élx). 


In this problem you are being asked to show the general conditions that are 
necessary for this symmetry to hold. Prove that if the differential equation is 
in Sturm-Liouville form (see Exercise 10.53) and the boundary conditions are 
homogeneous, the Green’s function will always have this symmetry. 

To show that this is the case, begin by writing the differential equation that 
g(x]é) satisfies for two different values of €: & and &. Multiply the first by 
g(x|&) and the second by g(x|é,) and subtract to obtain: 


| 


d 
— g(xléi)— ak [pw 


eel) 5 [oc ) | 


= g(xl&)5(x — &:) — gl&i5x — &). 
Show that this expression becomes 


0) - 3 da Sle 


Glé)px | 


5 [etleopes ) 


= g(xl&)5(x — &:) — g(xlé)6(x — &). 


Now if the solution is to be valid over the range x; < x < x2, and the boundary 
conditions are such that the function or its derivative must go to zero at x = x 
and x = x2, show that the above equation can be integrated to give 


&(Glé) = e(&1&). 
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56. Using the steps outlined in the previous problem, determine the symmetry prop- 
erties of the Green’s function that satisfies the differential equation 


d?y(x) | 9 dy(x) 


3 
x x 
dx? dx 


— xy(x) = d(x), 


with the boundary conditions y(0) = y(1) = 0. Note that this differential 
equation is not in Sturm-Liouville form. 


11 


SOLUTIONS TO 
LAPLACE’S EQUATION 


It would be impossible to discuss all the different types of differential equations 
encountered in physics and engineering problems. In this chapter, we will concentrate 
on just one, Laplace’s equation, because it occurs in so many types of practical 
problems. In vector notation, valid in any coordinate system, Laplace’s equation is 


V-@ = 0. (11.1) 


We will investigate methods of its solution in the three most common coordinate 
systems: Cartesian, cylindrical, and spherical. The solution methods we present here 
will also be applicable to many other differential equations. 


11.1 CARTESIAN SOLUTIONS 


In Cartesian coordinates, Laplace’s equation becomes 


oe e 


eo 
Be say? + re; P(x, y,z) = 0. (11.2) 


V7@(x, y,2) = | 


A standard approach to solving this equation is to use the method of separation of 
variables. In this method, we assume P(x, y, z) can be written as the product of three 
functions, each of which involves only one of the independent variables. That is, 


P(x, y,z) = X(X)Y(y)Z(@). (11.3) 


It is important to note that not all the possible solutions of Laplace’s equation in 
Cartesian coordinates have the special separated form of Equation 11.3. As it turns 
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out, however, all solutions can be written as a linear sum (of potentially infinite 
number of terms) of these separable solutions. You will see examples of this as the 
chapter progresses. Using this form for (x, y, z), Laplace’s equation becomes 


1 6X 1 3Y rz 
ae 2 Fee ee (11.4) 
X(x) ax? ¥(y) dy? Xz) 2? 
Each of the three terms on the LHS of Equation 11.4 is a function of only one 


of the independent variables. This means that, for example, the sum of the last two 
terms 


1 PY(y) | 1 6*Z(2) 


Y(y) dy? Z(z) dz? Gh?) 


cannot depend on x. A quick look back at Equation 11.4 shows that this implies the 
first term, itself, also cannot be a function of x. It obviously is not a function of y or 
z either, so it must be equal to a constant: 


1 6?X(x) 
= Ey. 11.6 
Xa) ae * ee 

Similar arguments can be made to isolate the other terms: 

1 #Y(y) 
= =c (11.7) 
Y(y) dy? 

2 
1 d*Z(z) _ (11.8) 


7) 


The three quantities c,, cy, and c, are called separation constants. Notice that they 
are not completely independent of one another, because Laplace’s equation requires 


Cy tcy +e, = 0. (11.9) 


Consequently, only two can be chosen arbitrarily. 


11.1.1 Solutions of the Separated Equations 


Each of the three separated equations (Equations 11.6-11.8) is the same linear, 
second-order differential equation. The solution to Equation 11.6, for a particular, 
nonzero value of c,, is simply 


X(x) = XsetV%* + X_e Vo" cx #0, (11.10) 


where X, and X_ are arbitrary amplitudes, which are determined only when we apply 
the boundary conditions. When this general solution is combined with Y (y) and Z(z), 
these constants combine with the corresponding Ys and Zs constants to form other 
constants. For this reason, we will ignore these amplitude constants until the final 
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form of P(x, y, z) is constructed and initially just concentrate on the functional form 
of the separated solutions. Equation 11.10 will be written, using a shorthand notation, 
as 


+ Ox 
XQ) > oan c, #0. (11.11) 
e P 4 


We now turn our attention to the solutions for X(x) with different separation 
constants. The constant c, may be negative and real, positive and real, or complex. 
Because the solution depends on the square root of c,, it turns out the results of a 
complex c, are covered by a combination of solutions generated by real positive c, 
and real negative c,. Thus, we will limit our attention to real values of c,. There is 
also the special case of c, = 0 to consider. 

In the case with c, = 0, the differential equation becomes 


1 d?X(x) 
—— —_— =0, 11.12 
X(x)_ dx? o ( ) 
and the general solution has a special form 
X(x) = Ax + B, (11.13) 


where A and B are arbitrary amplitude constants. In our shorthand notation, this is 
written as 


X(x) > e c, = 0. (11.14) 


This is qualitatively very different than the form of Equation 11.11. Often these 
special linear solutions can be ignored because of some physical or mathematical 
reasoning, but not always, and one must be careful not to forget them. 

If c, is real and greater than zero, we can write Vex = +a where a is real and 
positive. The solutions for X(x) take the form of growing or decaying exponentials 


+ax 


X(x) > ee Cc, =a? >0. (11.15) 


These solutions are shown in Figure 11.1. Often, it is convenient to rearrange things 
and use hyperbolic functions instead: 


sinh(ax) 


are) 
ciehieex) cy =a’ > 0. (11.16) 


X(x) > { 


The two forms in Equations 11.15 and 11.16 are completely equivalent. Convenience 
dictates which set is the better choice. Some boundary conditions are more easily 
accommodated by the exponential solutions, and others by the hyperbolic functions. 
The sinh function has odd symmetry and is zero at x = 0. The cosh function has 
even symmetry, and is one at x = 0. These functions are shown in Figure 11.2. If a 
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Figure 11.1 Exponential Solutions to Laplace’s Equation in Cartesian Geometry 


problem naturally has odd or even symmetry, the hyperbolic functions are usually a 
wise choice. 
If c, is a real negative number, its square root is imaginary. That is, \/c, = ik with 
k > 0. Therefore, for a negative separation constant, the solutions are oscillatory: 
etikx , 
X(x) 3 {ous c= -P <0. (11.17) 


For solutions that must be pure real functions, this form is usually clumsy, because 
the amplitude constants need to be complex. Equation 11.17 can be rewritten using 
sin and cos functions: 


sin(kx) 42 
X(x) > eae Cy = —k* <0. (11.18) 


Again, the choice between the two forms is one of convenience. 

In summary, the separated solutions to Laplace’s equation in Cartesian geometry 
have three possible forms. If the separation constant is zero, the solution has the 
linear form Ax + B, if the constant is negative, the solution oscillates, and finally, if 
the constant is positive, the solution is an exponential. Notice, because the separated 
differential equations for Y(y) and Z(z) are identical to the equation for X(x), their 
solutions follow a similar pattern. An interesting condition is imposed on these 
three solutions by Equation 11.9. If the separated solution in one direction has an 


Cosh(ax) 


Figure 11.2 The Hyperbolic Solutions to Laplace’s Equation in Cartesian Geometry 
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exponential form, at least one of the other solutions must have an oscillatory form 
and vice versa. We will discover that this is a general trait of the separated solutions 
in other coordinate systems as well. 


11.1.2. The General Solution 


The solution for P(x, y, x) is the product of the three separated solutions, as described 
by Equation 11.3. The amplitude and separation constants are determined by a set of 
boundary conditions. These boundary conditions may allow a particular separation 
constant to take on more than one value and, in many cases, an infinite set of 
discrete values. In such a situation, because Laplace’s equation is linear, the general 
solution will be composed of a linear sum over the acceptable values of the separation 
constants. 

Let the possible nonzero values of c, be indexed by m. That is, the mz" allowed 
separation constant for x is Cyn. Likewise, let cy, be the n” nonzero separation 
constant for the y variable. Equation 11.9 implies that c, cannot be independently 
specified, because c, = —Cy_ — Cy,- Using the shorthand notation, we can write the 
general solution as a sum over the two indices n and m: 

+ Senex +. femyy +f Cn —CmyZ 
P(x,y,2) = >> i ax ie ey —— (11.19) 

nym 


+ The Special Linear Solutions. 


The sum on the RHS of this equation is a shorthand notation for writing all the 
combinations of solutions that have nonzero separation constants, each with an un- 
determined amplitude constant. For each value of m and n, there are eight possible 
combinations of the solution types. If you wanted to expand this out, it would look 
like this: 


Amne? Vom et Vem et Jf Ens = CmyZ 
+ Bmne Vint et Sem) ot (Ena Cry? 
+ Cnet Vier ee” Vom ot fen Cmye 
+Drne VonX e~ Vem ot /~ ens Cyt 
DDL) a Bet Vinig VI g/m (11.20) 
+ Fiane Ve * Vom ~~ Ena Eye 
+ Gone? Vem eo Vem gp /~ Cn Cm? 
+ Hane Vom e~ Vem oem Emyz 


Many times the symmetry of the problem will eliminate some of the terms in the 
bracketed sum of Equation 11.20, and those particular amplitude constants will be 
zero. For this reason, it is convenient to use the shorthand notation of Equation 11.19 
and not address the amplitude constants until the symmetry arguments have placed 
the general solution in more tractable form. 
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The “special” solutions of Equation 11.19 are the ones that fall outside of normal 
‘orm because one or more of the separation constants is zero. For example, if a 
sossible solution has c, = 0, but c, = c and c, = —c, there is a special set of linear 


solutions 
1 et vey etVv—-a 
tt ee ’ (11.21) 


When c, = cy = c, = 0, another set of special solutions is 


1 fl fi 
e : {: ‘ (11.22) 


The general solution will be unique, that is to say all the amplitude and separation 
2onstants will be determined, only if enough boundary conditions are specified. For 
Laplace’s equation this means that the solution will be unique in a region if the value 
of B(x, y, z) or its normal derivative, or a combination of P(x, y,z) and its normal 
derivative is specified over the entire surface enclosing that region. These are referred 
to as the Neumann or Dirichlet boundary conditions, respectively. 


Example 11.1 To see how the boundary conditions are used to determine the con- 
stants of Equation 11.19, consider the two-dimensional problem depicted in Fig- 
ure 11.3. Two parallel plates exist in the planes y = +h and y = —h, with periodic 
Square wave electric potentials applied as shown. The problem is two dimensional, 
because there are no variations in the z-direction. The potential along the x-direction 
repeats with a spatial period of 2d. The boundary conditions over the first period can 
be summarized as 


+1 O<x<d 
P(x, +h) = Be eee (11.23) 
y 
@(x,+h) +1 4 | Ps | Si 
2d 
»¢ 
@(x,-h) -1 +1 | +1 ~1 Ho! 
| 


Figure 11.3. Two-Dimensional Laplace Equation Problem with Square Wave Boundary Con- 
ditions 
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and 


_f-l O<x<d 
B-= {51 ea (11.24) 


We want to find the solution for the potential between the plates, using the boundary 
conditions and the two-dimensional form of Laplace’s equation: 


2 2 
GPx, y) | PHY) _ 9 (11.25) 
ax2 dy? 

Based on our previous discussion, we will look for a two-dimensional solution with 
the form 


D(x, y) = X(x)¥(y). (11.26) 


A quick look at the nature of the boundary conditions shows that the solution of X(x) 
must be oscillatory in order to satisfy the periodic nature of the problem. Therefore, 
it must use a negative, real separation constant: 


1 @X@) _ pp 


ay de (11.27) 


where we have put the constant in the form of a square to make the subsequent algebra 
simpler. The other solution must therefore obey 


1 @¥0) 


= +k’, : 
YG) dy? k (11.28) 


The possible solutions to Equation 11.27 are constrained by two facts. First, 
because the boundary conditions of the problem have odd symmetry about x, X(x) 
must also have odd symmetry. Also, because the boundary conditions of the problem 
have a periodicity of 2d, each solution of X(x) must oscillate with a period of 2d/n 
where n is any positive integer. This ensures X(x + 2d) = X(x). All the possible 
solutions of X(x) can be written using 7 as an index, 


X(x) — sin(k,x), (11.29) 


where k, = am/d. Notice the k = 0 solution is not allowed, since the corresponding 
solution of X(x) = Ax + B would violate the combination of the symmetry and 
periodic requirements. This means we do not need to consider the special linear 
solutions we mentioned in the previous section. For the Y(y) solution, we will use 
the sinh function, because it matches the odd symmetry of the boundary conditions 
in the y-direction: 


Y(y) — sinh(k,y). (11.30) 
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The combination of these two solutions generates the general solution for B(x, y), 


ao 


@(x,y) = > Ansin(knx) sinh(kny), (11.31) 


n=1,2,... 


where the amplitude constants A,, are finally introduced. 

The A, coefficients are determined by requiring Equation 11.31 go to the applied 
potential on the boundaries. Since we have already imposed the odd symmetry in 
the y-direction, if the expression in Equation 11.31 goes to the proper potential at 


y = +h, it will automatically go to the correct value at y = —hA as well. Thus to 
make everything work, the solution must satisfy 
= . (mmx\ ., f(mh\ [+1 0<x<d 
»» anon (= *) sian (=) = es d<x<2d° at3?) 


The LHS of Equation 11.32 is in the form of a Fourier sine series. A particular 
coefficient A,, can be obtained by operating on both sides of this equation with 


[ow sin (=). (11.33) 


Every term in the series drops out except the n = m term, giving the result: 


0 m even 
Am = 4 ; (11.34) 
penne: 


The complete solution to the problem is then 


_ 4 . (AMX, . niry 
P(x, y) = pss Berar sin ( : ) sinh (=). (11.35) 


A plot of the first 21 terms of the RHS of Equation 11.35, for the region —2d < 
x < 2d and ~h < y < h, is shown in Figure 11.4. Notice the Gibbs overshoot 
phenomenon occurring at the discontinuities. 


Example 11.2 The previous example was particularly simple, because the periodic 
boundary conditions immediately forced one of the separated solutions to have an 
oscillatory form. The boundary conditions shown in Figure 11.5 are not as obliging. 
This is again a two-dimensional problem, but now we are looking for the solution to 
Laplace’s equation in the interior of an a X b box. On the two vertical surfaces, the 
potential is required to be zero, while on the top and bottom surfaces, ® = +1. In 
this case, it isn’t obvious that either X(x) or Y(y) needs to have an oscillatory form, 
because none of the boundary conditions are periodic. We can use a common trick, 
however, and attempt to convert this problem into an equivalent periodic problem. 
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a8) \ 


Figure 11.4 The Electric Potential Distribution for Square Wave Boundary Conditions at 


P=! 


@=-1 


Figure 11.5 Square Box Boundary Conditions 


We will try to change the problem into the form shown in Figure 11.6. The position 
of the original a X b box is indicated by the shaded region shown. The top and bottom 
boundaries of this shaded region must be fixed at ® = +1 so they match the original 
boundary conditions. We need to apply periodic conditions along the rest of the top 
and bottom surfaces, where the ? marks appear in the figure, to try to force the vertical 


pa ee I i oe 
| a 
pe Pp = ? 4 
b 


Figure 11.6 Attempt to Convert to a Problem Periodic in the x-Direction 
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®=1 o=-1 =! @=-1 © =1 
Vl 
a 
O=-1 =i oe-4 O=l o=-1 
fDi da 


Figure 11.7 Conversion to a Periodic Problem in the x-Direction 


sides of the shaded region to have ® = 0. Looking back on the previous example, 
it is pretty easy to see that these requirements are met with the potential distribution 
shown in Figure 11.7. The solution inside the box is therefore identical to the solution 
to the previous example with the x-periodicity equal to 2b. The only difference is, of 
course, that this solution is only valid inside the region of the original a X b box of 
Figure 11.5. 

We successfully made this problem periodic in the x-direction. Could we have 
done the same thing in the y-direction? Figure 11.8 shows what this would look like. 
Again, we need to apply periodic potentials to the areas where the ? marks occur in 
the figure, to try to force the horizontal sides of the shaded region to have ® = +1. 
It’s pretty obvious this cannot work. In this case, the x-direction must be chosen as 
the periodic one. 

An interesting twist is added to this problem if we modify the boundary conditions, 
as shown in Figure 11.9. In a case like this, you will find it is impossible to simply 
make either the x- or y-direction periodic. Instead, the problem must be broken up 
into two parts, as shown in Figure 11.10. Because Laplace’s Equation is linear, the 
solution can be written as the sum of two solutions, ®;(x, y) and ®(x, y), which 
each satisfy modified boundary conditions shown for the two boxes on the right. The 
function ®, (x, y) satisfies Laplace’s equation and has boundary conditions which are 
amenable to periodic extension in the x-direction. The function P2(x, y) also satisfies 
Laplace’s equation, but with conditions which can be extended in the y-direction. 
The combination ®,(x, y) + ®2(x, y) = P(x, y) satisfies Laplace’s equation inside 
the box and satisfies the original boundary conditions of Figure 11.9. 


11.2. EXPANSIONS WITH EIGENFUNCTIONS 


Before looking at solutions to Laplace’s equation in other coordinate systems, we need 
to establish some general techniques for expanding functions with linear summations 
of orthogonal functions. In the Cartesian problems we have discussed so far, we found 
that we could evaluate the individual amplitude constants in the general solution by 
using Fourier series type orthogonality conditions. This worked because the terms in 
our sums of the general solution contained sine and cosine functions. In this section, 
we will generalize this technique for other problems. We will begin by reviewing the 
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Figure 11.8 Improper Conversion to a Periodic Problem in the y-Direction 


Figure 11.9 Compound Cartesian Boundary Conditions 
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Figure 11.10 Dividing the Compound Problem into Two Periodic Problems 


2igenvalue and eigenvector concepts associated with the linear algebra of matrices, 
with which the reader is assumed to be familiar. These ideas will then be extended to 
the eigenvalues and eigenfunctions of linear differential operators. 


11.2.1 Eigenvectors and Eigenvalues 


Eigenvectors and eigenvalues are concepts used frequently in matrix algebra. A matrix 
[M] is said to be associated with a set of eigenvectors [V,,] and eigenvalues A,,, if 


In other words, a vector is an eigenvector of a matrix if premultiplying it by the 
matrix results in a constant times the original vector. The proportionality constant is 
the eigenvalue. Given the matrix [M], the eigenvalues are found from the condition 


I[M] — [1)Anlaer = 0. (11.37) 


The eigenvectors, to within an arbitrary constant, can then be found by inserting the 
2igenvalues into Equation 11.36 and solving for the components. A 3 X 3 matrix will 
have three eigenvalues. If the eigenvalues are all different, there will be three distinct 
eigenvectors. If there is some degeneracy in the eigenvalues, then the eigenvectors 
associated with these degenerate eigenvalues will not be uniquely defined. These 
ideas were covered in detail in Chapter 4. 

A matrix (M] is Hermitian if 


M;; = M’. (11.38) 


That is to say, a matrix is Hermitian if it is equal to its complex conjugate transpose. 
If a matrix is Hermitian, its eigenvalues are always pure real, and its eigenvectors are 
orthogonal. 


11.2.2 Eigenfunctions and Eigenvalues 


These ideas can be extended to operators. Let £,,(x) be a linear differential operator. 
In direct analogy with the matrix definition, the set of eigenfunctions ¢,(x) associated 
with this operator satisfy the condition 


Lop(X) Pa(x) = Andn(x). (11.39) 
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In other words, when the operator acts on any of its eigenfunctions, the result is just a 
constant times the eigenfunction. The constant A,, is still referred to as an eigenvalue. 

Actually, it is convenient to extend this definition a little further, to include what 
is called a weighting function. The eigenfunctions of an operator with a weighting 
function obey the equation 


Lop(x) Gn(x) = Anw(x) n(x), (11.40) 


where w(x) is any real, positive definite function of x. Positive definite is a fancy way 
of saying w(x) > 0 for all x. Our original definition in Equation 11.39 is a just special 
case where w(x) = 1. 


11.2.3 Hermitian Operators 


An operator £,,(x) can be combined with a set of basis functions u,,(x) to form a set 
of matrix elements 


bij = [he wie oC (11.41) 


The functions u, (x) may or may not be the eigenfunctions of the operator. In general, 
these functions are complex, as is the operator itself. The matrix elements generated 
by this integration can also be complex. The integration is done over some range of 
the variable x that we label with the symbol 2. 

Following an analogy with matrix algebra, a linear operator is Hermitian if 


Lj = Lis (11.42) 
or equivalently 
is dx Ut (x) Lop (x) = A dx u,(x) [Lopu;(x)]" . (11.43) 


In general, satisfying the Hermitian condition of Equation 11.43 depends not only 
on the operator £,,, but also on the functions u,(x) and specifically the nature of 
their boundary conditions at the limits of 0. Most often, it is convenient to use the 
eigenfunctions of the operator for the u,,(x), but in some cases other functions are 
used. 


11.2.4 Eigenvalues and Eigenfunctions of Hermitian Operators 


The eigenvalues and eigenfunctions of a Hermitian operator have three very useful 
properties. First the eigenvalues are always pure real. Second, the eigenfunctions 
will always obey an orthogonality condition. Third, the collection of eigenfunctions 
usually forms a complete set of functions. That is, any function can be expanded as 
a linear sum of the eigenfunctions. 

To prove the first two of these assertions, let @,(x) be the n” eigenfunction of 
the Hermitian operator £,,, with a corresponding eigenvalue of A,,. We will use the 


EXPANSIONS WITH EIGENFUNCTIONS 437 


extended version of the eigenvalue/eigenfunction condition that includes a weighting 
function: 


Lop dn(X) = Anw(x)dn(x). (11.44) 


Remember this weighting function is areal, positive definite function. If the Hermitian 
condition of Equation 11.43 is satisfied, then Equation 11.44 can be substituted into 
this condition to give 


I de 3 (x)Anw(x)ba(x) = i dx bol z)MWO)OS(x), (11.45) 

which, on rearrangement, becomes 
Opa) a dx bh (xxl) = 0. (11.46) 
This equation contains some very useful information. If m = n, the quantity 


5 (x)w(x)ba(x) = |bn(x)|?w(2) is positive definite, so 
| dx op (x)w(x)dn(x) # 0. (11.47) 
a 


Consequently, 
An = Aj. (11.48) 


This shows the eigenvalues of a Hermitian operator are real. In addition, if m # n 
and A, # Am, the only way Equation 11.46 can work is if 


| dx 6, (x)w(x)obn(x) = 0. (11.49) 
0) 


Equation 11.49 is an orthogonality condition. Notice the range of the integral Q. is 
the same range used in Equation 11.43 to produce the Hermitian condition. 

If two or more eigenfunctions have the same eigenvalue, that is, if A, = Amn, 
the line of reasoning which leads to Equation 11.49 is not valid. We can, however, 
always arrange our choices for the degenerate eigenfunctions so that Equation 11.49 
still holds. This is directly analogous to the case of degenerate eigenvalues in ma- 
trix algebra, discussed in Chapter 4. The general method for constructing linearly 
independent, orthogonal eigenfunctions that have degenerate eigenvalues is called 
the Gram-Schmidt orthogonalization process. You can work out an example of this 
process in one of the exercises at the end of the chapter. 

We also claimed that the eigenfunctions of a Hermitian operator will usually form 
the basis of a complete set of functions. Completeness is a difficult property to prove 
and beyond the scope of this text. It can be shown that the eigenfunctions of a Sturm- 
Liouville operator always form a complete set. These types of operators are discussed 
in the following section. 
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The consequences of having a complete set of orthogonal eigenfunctions are 
enormous. If this is the case, we have a method for expanding arbitrary functions in 
terms of these eigenfunctions. Imagine we wanted to expand the function s(x) using 
the eigenfunctions of an operator that obeys Equation 11.44. If the eigenfunctions 
are complete, we can write 


s(x) = So Spdn(x), (11.50) 


where the S,, are (possibly complex) coefficients that we need to determine. To isolate 
the m’* coefficient, multiply both sides of this expression by w(x) (x) and integrate 
over the full range: 


ip dx s(x)w(x)@*(x) = [ dx » sso w(x), (x). (11.51) 
Q a : 


The orthogonality condition will force every element of the sum on the right-hand 
side to vanish, except the » = m term. That single remaining term gives an expression 
for S,,,: 


_ J dx 5(x)w(0)O7,@) 
fo dx bm (HW) H 3H) (11.52) 
11.2.5 Sturm-Liouville Operators 


We will now prove that the Hermitian condition always holds for the so-called Sturm- 
Liouville operators, if a special set of boundary conditions is imposed on the functions 
used in the Hermitian integration. An operator is in Sturm-Liouville form if it can be 
written as 


d d 
Lop = ae loo + q(x), (11.53) 


where p(x) and g(x) are two real functions. This is not a very restrictive condition, be- 
cause the differential operations in all second-order linear equations can be converted 
into this form by multiplying the differential equation by the proper function. 

To see how the Sturm-Liouville form of Equation 11.53 leads to the Hermitian 
condition, substitute this operator into the LHS of Equation 11.43 to obtain 


* d d * 
I ax u;(x)— | p(x)——u,(x)} + i dx u;(x)q(x)u;(x). (11.54) 
a dx dx rey 
Likewise, the RHS of Equation 11.43 becomes 


d d , * 
ib ax uj(x) ores o) + I ax uj (x)q(x)u; (x). (11.55) 
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The last terms of both these expressions are obviously the same, so to prove Equa- 
tion 11.43, we only need show the equality of the first two terms. Integrating these 
terms by parts and equating them gives 


[were 2] p dx Oy HO) 
= [erp HO] - fas SH) 9) (11.56) 


The integrals in Equation 1 1.56 cancel, so if we limit our possible choice of functions 
to those that satisfy 


—2—|| =0 157 
de ( ) 


juts Ha] 


for all possible choices of u,,(x) and u,(x), the Hermitian condition will always be 
satisfied. ; 

It is important to note that an operator that is not in Sturm-Liouville form can still 
be Hermitian. A good example of this is the operator £ = —id/dx, which you can 
easily show is Hermitian when appropriate boundary conditions are imposed on its 
eigenfunctions. We presented the proof for Sturm-Liouville systems here for several 
reasons. First, all the second-order, separated differential equations that are used to 
solve Laplace’s equation can be put into Sturm-Liouville form. In addition, although 
we will not prove it here, if an operator is in Sturm-Liouville form, its eigenfunctions 
form a complete set of functions. The proof of this fact can be found in Griffel’s 
book, Applied Functional Analysis. Consequently, a linear sum of these complete, 
orthogonal solutions can be used to satisfy any set of well posed boundary conditions. 


Example 11.3. Consider the simple differential operator 


d2 


a (11.58) 


Gee 


For the simplest weighting function w(x) = 1, the eigenfunctions and eigenvalues of 
this operator satisfy the equation 


d2 
De G(X) = Anda(x). (11.59) 

Xx 
We already encountered this equation when we separated Laplace’s equation in 
Cartesian coordinates. Recall, there were several possible forms for the general 


solution of Equation 11.59. One form was 


n(x) = A,e*V** + Bie Vr, (11.60) 
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where A,, and B,, are complex amplitude constants. Since we have not yet imposed any 
other requirements beyond the differential equation, the eigenvalues and amplitude 
constants are undetermined. 

What happens if we check to see if £,, is Hermitian? First, notice that this operator 
is in the Sturm-Liouville form of Equation 11.53, with p(x) = 1 and q(x) = 0. This 
means the Hermitian condition is satisfied, using the eigenfunctions as the basis 
function, as long as the ¢,(x) satisfy the condition of Equation 11.57: 


b 
dom = 0. (11.61) 
dx 


Pn 


Notice that the range of integration 0 has been defined by the two end points x = a 
and x = b. The easiest way to satisfy this expression is to require the ¢,(x) or their 
first derivatives be zero at these endpoints. The only way to accomplish this is to make 
¢,(x) oscillatory, so the A,, must be real, negative numbers. If we define ik, = J An» 
this puts Equation 11.60 into the form 


dn(x) = Ane + Biew (11.62) 


If we specifically force $,(a) = 0 and ¢,(b) = 0, the eigenvalues are limited to the 
discrete set of values 


dn = ~ ( at )’ n=1,2,3..., (11.63) 


b-a 


and the most succinct form for the eigenfunctions becomes 


n(x) = C, sin (“) (11.64) 


The amplitude constant C,, is arbitrary, but is usually chosen so the eigenfunction is 
normalized over the range of (2. In this particular case, we can set 


ey : n(x — a) 
n(x) = baa sin (—) . (11.65) 


According to our previous discussion, these eigenfunction should obey an orthogo- 
nality condition. Indeed this is the case, because it is easy to show that 


b 
j dibiDbky =O (11.66) 
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11.3 CYLINDRICAL SOLUTIONS 


Now that we know how to perform expansions using the eigenfunctions of Hermitian 
operators, we can proceed to solve Laplace’s equation in other coordinate systems. In 
this section, we explore the solutions to Laplace’s equation in cylindrical geometries. 
In cylindrical coordinates Laplace’s equation becomes 


a 1¢@ 1 & a 
+ = Dp, 6,z) = 0. (11.67) 


V’*@(p, 6,z) = ~ 
Plier Vad Gap gh ARR. B 


We will again use the method of separation of variables and look for solutions of the 
form 


P(p, 6,2) = R(p)H(8)Z(z). (11.68) 


Inserting this form into Laplace’s equation gives 


R(p) dp? pR(p) dp ~—p*H(0) 002 — Zz) a? 


1 #R) , 1 AR), 1 _ FHO , 1 FZ _ 4 ay 69) 


11.3.1 Solutions of the Separated Equations 


In cylindrical coordinates, things are quite a bit more complicated than they were in the 
Cartesian case, because the p and 6 dependencies are not isolated in Equation 11.69. 
In addition, unlike the Cartesian solutions, the p, 0, and z separated solutions do not 
all have the same functional form. We will tackle these different solutions one at a 
time. 


Separating the z-Dependence The z-dependence is easy to separate because it is 
entirely contained in the last term of Equation 11.69. Setting this term equal to a 
separation constant c, gives 


1 0°Z(z) 
sO 11.70 
Z(z) dz? os : : 
This has the form of an eigenvalue problem 
d°Z(z) 
= ¢,Z(2). (11.71) 
oz? 


Just like the Cartesian examples we presented earlier in this chapter, the possible 
values that c, can take on will be limited by whatever boundary conditions are 
specified for the problem. For example, perhaps P(p, 9, z) is required to be zero at 
z = aand z = b. We explored this case already in a previous example. This forces 
Z(z) to have an oscillatory form, and consequently c, must be negative and real. If 
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we set c, = —k? < 0, the form of Z(z) is 
e tik sin(kz) ia 
Z(z) > {on or Z(z)— oe C= -k <0. (11.72) 


We have already shown that this operator, when combined with the functions of 
Equation 11.72 and the proper boundary conditions, is Hermitian. Therefore, we can 
expand arbitrary functions of z using this set of solutions. 

In contrast, we might have a set of boundary conditions which requires P(p, 4, z) 
to vanish as z — +. Oscillatory solutions of Z(z) are obviously not going to work 
in this case. In fact, the only way to get a solution that meets these conditions is to 
use a positive, real separation constant so the solutions of Z(z) are exponential. With 
c,=a*>0, 


sinh(@z) 


ee: 
cosh(ae) cp = a’ > 0. (11.73) 


Zo= ae or z@ > { 


These solutions, when combined with the 9? /dz* operator, do not satisfy the Hermi- 
tian condition and consequently do not obey an orthogonality condition. This means 
it is not possible to use these functions in an expansion. 

As always, the special case where c, = 0 needs to be considered. The form of 
Z(z) is linear: 


Z(z) > . c, = 0. (11.74) 


Separating the 0-Dependence After removing the z-dependence and multiplying 
by p”, Equation 11.69 becomes 


2 92 2 
po R(p) | Pp ORC) | on L(G) _ 


RG) a * Re) ap °° %* HO) oe? an 


The 6-dependence is now isolated to the last term, which can be set equal to a second 
separation constant cg: 


1 06*H(@) 
"= &p. 11.76 
HO) 02 © eon) 
This equation is also in the form of an eigenvalue problem, identical to the one 
describing the z-dependence. Taking ce = —v? < 0 gives orthogonal, oscillating 
solutions of the form 


+iv6 


(0) {enine of HCO)» {SiN 


Frond 
eas co =—-v <0. (11.77) 
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Taking cg = 8’ > Oresults in nonorthogonal solutions of the form 


sinh(B@) 


cosh(B6) co = Bp? > 0. (11.78) 


e+ Be 
(60) > {er or H(0)— { 
The special case of cg = O gives the form 


H(0) > ie SO) (11.79) 


Most cylindrical solutions need to be periodic in 6, repeating every 277. If this were 
not the case, H(0) # H(@ + 27), and ®(p, 9, z) would not be a single-valued function 
of position. For solutions that are periodic in 0, the form given in Equation 11.77 
with vy = 1,2,3,... and the H(@) — 1 solution of Equation 11.79 are the only 
ones allowed. We can actually combine these two forms together into just the form 
of Equation 11.77 if we allow » = 0 to be included in that expression, so now 
Ca = —v? < 0. There are, however, some cases where the boundary conditions 
limit the range of @ to something less than 0 to 277. In these cases, cg can be greater 
than zero, and the other solutions of Equation 11.78 plus the H(@) — 6 solution of 
Equation 11.79 are allowed. 


Separating the p-Dependence With the z- and @-dependencies removed by insert- 
ing their separation constants, the differential equation for R(p) becomes 
2 92 
p’ a R(p) p_ OR(p) 2 
+ ————— + c,p* + ce = 0. (11.80) 

Rip) dp? Rp) ap 
The form of the solution to this equation depends on the nature of the separation 
constants. When cg <= 0, the most common case, the solutions of R(p) are either 
Bessel functions or modified Bessel functions depending on the sign of the c, constant. 
The sections that follow derive series representations for these important functions. 


11.3.2 Solutions for R(p) with c, > 0 andcg = 0 


The first situation we will consider is the case for cg = —v* = Oandc, = a? > 0. 
Notice we have included v = 0 in this category as we discussed when separating out 
the 6-dependence in the previous section. This is a solution that oscillates (except 
when v = 0) in @ and exponentially grows or decays in z. For this case, Equation 11.80 
becomes 


#R(p) , 1 AR(p) 
Op” p oO 
The procedure for solving this equation first involves scaling the p variable using 
the value of a. Notice that these two quantities have reciprocal dimensions so that 
the quantity (ap) is dimensionless. To accomplish this, rearrange Equation 11.81 into 
the form 


2 
+ a?R(p) — BRO) =0. (11.81) 


#R(p) , 1 AR(p) R(p) — v 


a(ap)? ap Aap) (apy = 0. (11.82) 
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Now define a new variable r = ap and a new function 
R(p) = R(ap). (11.83) 
With these new quantities, Equation 11.82 can be written as 


e’R(r) | 1 dR) 
+ 
or? r or 


2 
+ Rr) - SR) =0. (11.84) 


Once this equation is solved for R(r), the function R(p) is obtained using Equa- 
tion 11.83. This result shows that the a parameter acts like a scaling factor for the 
solution, because the qualitative behavior of R(p) does not depend on a. The v pa- 
rameter, however, is not just a scaling factor. The qualitative behavior of R(r), and 
thus R(p), depends on the specific value of v. 


Bessel Functions of the First and Second Kind Equation 11.84 can be solved 
using the method of Frobenius, described in the previous chapter. Recall that, with 
this method, we look for series solutions of the form 


R(r) = ee ae (11.85) 
n=0 


By plugging this into Equation 11.84, and requiring cp # 0, we get the indicial 
equation for s: 


r—-pv=0 (11.86) 
with the result 
s=tp. (11.87) 
This gives two solutions for R(r): 
Rr) = So anr™*”  froms = +p (11.88) 
n=0 
and 
Ror) = So bar" ” from s = —v. (11.89) 
n=0 


Two different sets of coefficients, a, and b,, have been used to distinguish between 
the solutions. 

When the first of these two series, Equation 11.88, is inserted back into the 
differential equation, the coefficients can be easily evaluated. The series is 


(-1)" Ger. 


R\(r) = a,2’v! § > ——_-—_( 11.90 
(7) EON) cal +) ( ) 


2 
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where a, is an undetermined coefficient. If a, is chosen to be 1/2” v!, the RHS of 
Equation 11.90 is defined as the v*-order Bessel function of the first kind: 


Jr) = > eR. Gyr". (11.91) 


= mi(m + v)! 


Unfortunately, if we try the same trick with the series in Equation 11.89, we run 
into trouble. If v is an integer, which is the most common situation, the b, coefficients 
will always diverge after some finite number of terms. Consequently, when v is an 
integer we cannot generate a second solution using this method. This divergence stems 
from a singularity of the original differential equation at r = 0. This second solution 
has a “logarithmic” singularity at r = 0, which means it cannot be represented by a 
power series. However, we can find the second solution if we modify the method of 
Frobenius slightly, by explicitly identifying the singularity with a logarithm factor. 
Now we look for a solution of the form, 


Ro(r) = In(r)J,(r) + 33 bar?*5, (11.92) 
n=0 


where J,(r) is the first solution we found in Equation 11.91. Actually, because the 
differential equation is linear, any linear combination of R\(r) = = J,(r) and the R,(r) 
obtained with this method is an equally legitimate second solution. This gives us 
some flexibility in choosing the second solution. A common choice is called the 
Bessel function of the second kind, or sometimes the Neumann function of order v. 
Deriving the result is straightforward for the v = Ocase. The general case for arbitrary 
v, however, is quite complicated. We will simply give the result here and recommend 
you investigate the classic reference by Watson, A Treatise on the Theory of Bessel 
Functions, for the details. The second solution is 


Y,(r) = = 1) (in 5 Fs 7) 


1526 1)" ag + Any) Gyr" 


ar ani(n+ v)! 
pol 
(v—n—1)! fry\2n-¥ 
5 11.93 
2 a! G) ( ) 


where h,, is the partial harmonic series defined by 


Ges en rere nee 
—_ boa a Br n>O- (11.94) 
0 n=0 
and ¥ is 
y = lim(/, — Inn) © .5772, (11.95) 


which is called the Euler constant. 
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Y,@ 


Bor Bir Bor Biz Bos Bis 
Figure 11.11 Bessel Functions of the First and Second Kind 


Because these functions are so complicated, they have been extensively tabulated 
in reference books, such as the Handbook of Mathematical Functions by Abramowitz 
and Stegun. The vy = O and v = 1 versions of these functions are plotted in 
Figure 11.11. All the J,(r) functions are zero at the origin, except for Jo, which 
has Jo(0) = 1. The functions all have the form of a damped oscillation, and the 
points where they cross zero turn out to be very important. These points are not 
evenly spaced, and we usually define the n’* zero crossing of J,(r) as the special 
constant a,,. These values are also extensively tabulated in the literature. The Y,(7) 
functions all go to minus infinity as r —+ 0. Away from the origin, these functions 
are again in the form of damped oscillations. The n™ zero crossing of Y,(r) occurs 
atr = Bin. 

After all this, it must be remembered that we originally were seeking the general 
solution of R(p) in Equation 11.81. We have obtained two independent solutions for 
R(r) in Equation 11.84. These solutions can be related by to R(p) via Equation 11.83. 
Therefore, the general solution for R(p) is any linear combination of the two solutions: 


J,(ap) 
R(p) > ae ; (11.96) 
The General Solution for ®(p,0@,z) The general solution for P(p, 6, z), for the 
case where c, = a? > O and cg = —v* S 0, is a linear sum of all the possible 
products of the separated functions, 
= cos(v@) fcosh(az) | J,(ap) 
P(p, 0,2) = pe 2 yea ee io , ln) 


where the summation is over all the allowed values of v and a. In general, the 
boundary conditions will limit these possible values, as you will see in the example 
that follows. 

Keep in mind, we could have just as easily chosen to write the @-dependence 
using complex exponentials, but that is not as convenient when we expect H(6) 
to be real. Likewise, the z-dependence has been written with the sinh and cosh 


CYLINDRICAL SOLUTIONS 447 


functions, but could just as easily have been expressed with growing or decaying 
exponential functions. Computational convenience dictates which choice makes more 
sense. Notice the 6-dependence is periodic, while the z-dependence is exponential. 
This confirms the general rule for separated solutions we declared at the beginning of 
the chapter. If a solution oscillates in one direction, it must exponentiate in another. 


Applying the Boundary Conditions Consider the boundary conditions described 
in Figure 11.12. The potential is controlled on the surface of a cylinder of radius r, 
and length 2z,. The sides of the cylinder are held at ®(r,, 6,z) = 0. The top is held 
at P(p, 9,z.) = fr(p, 9), a potential that varies with both p and 6, while the bottom 
is held at a different potential, ®(p, 0, —z,) = fp(p, 9). The problem is to determine 
the potential inside the cylinder. 

It seems appropriate to start with the general solution given by Equation 11.97, 
because the range of @ is 0 < @ < 27, which requires the solution to be periodic 
in @. Also, given the potentials on the top and bottom, the solution does not appear 
to be periodic in z. 

The geometry, boundary conditions, and a little physical reasoning allow the 
general form to be simplified somewhat. First, it would not make any physical sense 
for the potential to diverge as p — 0, so the R(p) solution will not have any of the 
Y,(ap) solutions. Next, the boundary condition which forces P(r,, 8,z) = 0 limits 
the possible values of a to a discrete set. Remember, the zeros of a particular J,(ap) 
Bessel function occur when the argument is equal to one of the constants a,,. So to 
have R(r,) = 0, we must have 


Oro = Qn, (11.98) 


where v = 0,1,2,... andn = 1,2,3,.... In this way, the summation over a in 
Equation 11.97 is converted to a summation over n, and the general solution is 


@(p, 8, +z,) = f,(p,8) 


+Z, 


P(r, 8, z)=0 


@(p, 8, —z,) = f,(p,9) 


Figure 11.12 Cylindrical Boundary Conditions Nonperiodic in the z-Direction 
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reduced to 


_< . fcos(v@) {cosh(@m2/To) 
940.0. = OY {Sn} ee Tes {Jslamp/r) (11.99) 


Notice that the value of a@ in the argument of the hyperbolic functions is determined 
by a boundary condition associated with the Bessel function. 

If the shorthand notation on the RHS of Equation 11.99 is expanded, it is nec- 
essary to introduce four amplitude constants. In order to determine these constants, 
the top and bottom boundary conditions, yet unused, need to be introduced. These 
amplitude constants can be determined much like we found the amplitude constants 
in the Cartesian problems at the beginning of the chapter. By applying orthogonality 
conditions, we can isolate and evaluate one of the constants associated with a partic- 
ular » and a particular n. This will require two different orthogonality relations, one 
to isolate values of v, and one to isolate values of n. 


Orthogonality Relations We already know that the sin and cos functions which 
describe the 6-dependence obey an orthogonality condition, while the sinh and cosh 
functions do not. To evaluate all the amplitude constants, we need an additional 
orthogonality relation, which we can obtain from the Bessel functions. We can derive 
it by putting Bessel’s equation in the form of an eigenvalue, eigenfunction problem, 
identify a Hermitian operator, and then determine the orthogonality properties of its 
eigenfunctions. 

Originally, the J,(a,,p/ro) Bessel functions were constructed to satisfy Equa- 
tion 11.81: 


a 1l@ yr a2 
5. ais a) Jy(Qmp/To) = — FI (Qimp/To)- (11.100) 
op* pop p rs 


This is in the form of a standard eigenvalue, eigenfunction problem with the weighting 
function w(x) = 1, 


Lop@n = AnPns (11.101) 
where 
Fe lo wv 
Lop = -—--—+s 11.102 
‘? dp? pdp- pp : ) 
on = J(amp/To), (11.103) 
and 
2 
Was, (11.104) 
> 


Because v is inside the operator and fixed, different eigenfunctions of this operator 
are obtained by incrementing n, i.e.,m = 1,2,3,.... 
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These functions will obey an orthogonality condition if, when they are combined 
with £,,, they satisfy the Hermitian condition, that is, if 


[a J, (QmP/ 1.) LopJv(Gmp/To) 


equals 


[ dp J(amp/To)LopIv(amP/To)- 


Unfortunately, it turns out that these two integrals are not equal and the Bessel 
functions, by themselves, are not orthogonal. 

We know, from our proof in the previous section, that an operator in Sturm- 
Liouville form, combined with the appropriate boundary conditions, will automati- 
cally lead to the Hermitian condition. The operator in Equation 11.102 is not, itself, 
in Sturm-Liouville form, but if Equation 11.100 is multiplied by p it becomes 


a oO vy a2 p 
—+—- — - im 
> are =| mp a Sodom / To) (11.105) 


which can be written as 


F F) 2 2 
=p ~~ | I anp/ro) = — =P I,(amp/ro)- (11.106) 
dp op p rs 
The modified operator for this equation is 
o8 2 
ap (11.107) 
op opp 


which is in Sturm-Liouville form. Equation 11.106 is in the form of a eigenfunction 
equation, with a weighting function of w(p) = p: 


Lipdn = Anpon- (11.108) 


Here again, the eigenfunctions and eigenvalues are 


on = Jy(Qinp/To) (11.109) 
2 
dy = SE. (11.110) 


Because the operator £7, is in Sturm-Liouville form, the Hermitian condition will 
be satisfied if Equation 11.57 is satisfied: 


= 0. (1.4111) 


“empl 


J, (Qnp/To)p qi 


2 
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Because the range of interest for this problem has an obvious lower limit of p = 0 and 
an upper limit of p = ro, this condition is easily satisfied. The lower limit vanishes 
due to the presence of the p term, and the upper limit vanishes because J,,(a,,) is by 
definition zero. Thus with © covering the range 0 < p < r,, the modified operator 
of Equation 11.107 is indeed Hermitian. This means the Bessel functions form a 
complete set of functions and obey the orthogonality condition of Equation 11.49: 


fi dp Jy(cmp/To)pF(amp/te) =O  n#m. (11.112) 
0 


In order to perform expansions, we need one more tool. We need to also know the 
value of the integral of Equation 11.112 when m = n. We give the result, without the 
proof, that the general expression is 


To r2 
j dp J) (Qmp/To) pI v(Cump/To) = Sum > Sot (an)s (11.113) 
oO 
where we have used the Kronecker delta symbol, Sym. 
The orthogonality condition of Equation 11.113 now allows any function f(p), 
which obeys f(p = r,) = 0 and f(p) < ©, to be expanded in the range 0 = p =r, 
using the Bessel functions. In other words, we can write 


f(p) = > AnJy(cunp/To)s (11.114) 
n=1 
with 
_ Jo’ ap f(p)pIv(omp/To) 


An 
(72 /2)J2., (aan) 


(11.115) 


Evaluating the Amplitude Constants Now we can pick up the cylindrical problem 
at the point of Equation 11.99. The problem is further simplified if we impose odd 
symmetry in z by setting fr(p, 0) = — fa(p, 0). This eliminates the hyperbolic cosine 
functions and Equation 11.99 becomes 


Dp, 6,2) = 5 > ene {sintainz/r2)Jo(amp/ro) (11.116) 
v=0 n=1 


sin(v@) 


At this point, it is convenient to insert the amplitude constants and explicitly write 
out the summations of Equation 11.116: 


“Sf Aim Sin(v6) sinh(Onz/To)Iv(Gmp/To) 
®D = : : 11. 
(9, 8,2) d, d ae cos(v) sinh(anZ/7>)J(OmP/To) ve 
Now we need to determine the A,, and B,,, coefficients using the boundary con- 
ditions on the top and bottom surfaces. Because the odd symmetry in z has already 
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been imposed, we can use either the top or bottom boundary. Picking the top, the 
series expression is evaluated at z = +2): 


_ S757 f Am sin(v6) sinh(enz0/r0)Jy(amp/te) 
Sri0.8)= SS saccular nual) } ee 


v=0 n=1 


The amplitude constants can now be isolated and evaluated by using the orthogonality 
conditions of the trigonometric functions and of the Bessel functions. Because there 
are two summations, one over 7 and one over v, two orthogonality conditions are 
required. 

To evaluate the A,,, operate on both sides of Equation 11.118 with 


20 
| d@ sin(v, 8). (11.119) 


This filters out all but the vy = vy, sine terms in Equation 11.118: 


Qa ie 
| d@ sin(v,9)fr(p, 8) = > Ayjntsinh(y,n2/To)Jv,(y,nP/To)- (11.120) 
oO 


n=l 


Notice how the order of the Bessel function vy, has been selected, not by the orthog- 
onality of the Bessel functions, but by the orthogonality of the sine functions. 

A particular value of n is selected by operating on both sides of Equation 11.120 
with 


[ dp pJy,(Qy,n,P/To)- (11.121) 
0 
This singles out the n, term to give 
lo 27 
[a etabetuno/re) 40 sin(v8) flo, ®) 
0 o 


2 
= Ayn, 7 sinh (@tsyn,Z0/o)-2 J +(Qyn,) (11.122) 
so that, in general, 


Je? dp pI. Aamp/ro) {.” dO sin(v®) fr(p, 8) 


Ayn = : 11.123 
7 (ar2/2) sinh(Q@ynZo/To)J?, ;(Qyn) : ; 
A Similar operation determines the B,, 
°° dp pJy(Qmp/To) [-” 40 cos(v6) fr(p, 8 
B,, = Jo’ 40 pI (@inp/To) J cos(v6)fr(o,8) Lg (11.124) 


(mr2/2) sinh(QinZo/To)Jv+1(Qn) 
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and 


(mr3) sinh(anZo/To)J?(aon) 


These amplitude constants are then inserted into Equation 11.117 to complete the 
solution. 

A simpler result is obtained if it is assumed that the potential on the top and bottom 
is not a function of 6, 1.e., fr(p, 9) — fi(p). Ina case like this, only v = 0 is allowed. 
There are no Ao, terms because sin(0) = 0. Only Bo, amplitudes are present, and the 
solution for P(p, z) inside the cylinder becomes 


Bon =0. (11.125) 


D(p, z) = > Bon sinh(a9,2/1o)Jo(QonP/To)s a 1.126) 
n=1 
with 
So’ dp pJo(GonP/To) fr (p) 
Bon = 73 )\aahiane Jp Ton)’ 11.127 
2/2) sinbh(o1onZ0/ro)J? (Gon) (11.127) 


11.3.3 Solutions for R(p) with c, < 0 andcg = 0 


We now turn our attention to cylindrical solutions of Laplace’s equation that oscillate 
in both the z- and 9-directions. The general solution is still a product of the separated 
solutions 


®(p, 0,2) = R(p)H(0)Z(z), (11.128) 


and the functions Z(z) and H(6@) still must satisfy Equations 11.70 and 11.76, respec- 
tively. Now, however, c, = —k* < Oand cg = —v? =< 0. This means that the form 
of solutions for these equations will be 


1kz 
Z(2) + hae or Z(2) 4 eens (11.129) 
and 
ivé : 
H(@) > { i or H(@)—> ee (11.130) 


The choice between the trigonometric and exponential forms of the solutions is, as 
usual, determined by whichever is more convenient for a particular set of boundary 
conditions. 
The separated equation for R(p) becomes 
2 2 
E + 12-8 Ela =o (11.131) 
op? pap p* 
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Equation 11.131 is identical to Equation 11.81 with a? — ~—k? or a — ik. The 
solutions to Equation 11.81 were found to be Bessel functions of the first and second 
kind: 


Ji(ap) and = Y,(ap). 


The solutions to Equation 11.131 can therefore be written as 


J,(ikp) 
R(p) > ae (11.132) 


The imaginary arguments in the Bessel functions have a similar effect as imaginary 
arguments in the sine and cosine functions. These functions no longer oscillate, but 
they grow or decay with p. We might have expected this, because the general rule for 
the separated solutions of Laplace’s equation in Cartesian systems was that at least 
one separated solution was growing or decaying. 

The functions in Equation 11.132 are not in the most convenient form, how- 
ever, because they are not pure real functions of p. We can correct this problem by 
constructing two different solutions which are pure real: 


L,(kp) = i-” J, (ikp) (11.133) 
K,(kp) = (a/2)i"* [J (ikp) + i¥,(ikp)). (11.134) 
These two functions are called modified Bessel functions of the first and second kind. 
Again, like the Bessel functions, the modified Bessel functions are well tabulated 
in the literature. Graphs of the first few, Jo(r), [i(r), Ko(r) and K,(r), are shown in 
Figure 11.13. Notice, the /,(r) functions are finite at r = 0 and blow up as r — =. 


In contrast, the K,(r) functions blow up as r — 0 and go to zero as r — ©, 
Now, we can write the general solution of R(p) as 


I,(kp) 
R(p) > ee ; (11.135) 


and the general form of ®(p, 6, z) for this case becomes 


ot cos(v0) |{cos(kz) { I,(kp) 
RepeGizy= dd. ee een ee (11.136) 
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In this expression, sine and cosine functions have been used for the z- and 0- depen- 
dencies. As always, complex exponential functions could have been used instead, but 
the sine-cosine functions are better when we expect the final answer to be pure real. 
Again notice the presence of at least one oscillating factor and one growing/decaying 
factor in each term. 


Example 11.4 As an example of an electrostatic problem that uses the type of 
solution given in Equation 11.136, consider the geometry shown in Figure 11.14. 
This is quite similar to the problem discussed in Figure 11.7, which described a 
periodic boundary condition in Cartesian coordinates. In this problem, the potential 
is controlled on the p = r, surface of the cylinder. Its value is +V, for0 < z< L/2 
and —V, for L/2 < z < L. We are interested in a solution for ®(r,, 6, z) inside the 
cylinder where p < ro. 

Notice that the boundary condition is periodic, so P(r,, 0,z) = P(r,, 0,z + L) 
for all z. Consequently, the solution for this problem must also be periodic in z, and 
So the solution to Laplace’s equation for p < r, must fall into the general form of 
Equation 11.136. The symmetry of these particular boundary conditions allows us 
to simplify this general form tremendously. First of all, there is no @-variation, so 
we can set H(@) = 1, and the sum over v only uses the v = 0 term. Second, the 
solution must be finite for p = 0, so there are no Ko(kp) terms, because they diverge 
as p —» 0. Next, because the boundary conditions have odd symmetry about z, all the 
cos(kz) terms vanish, because they are all even functions. Finally, the periodicity of 
the boundary conditions forces k to be limited to the discrete set of values: 


2 
k > ky = where n = 0,1,2,.... (11.137) 


With these simplifications, Equation 11.136 reduces to 


B(p, 8,2) = > Ay Sin(knz)lo(knp) (11.138) 


n=1 


_ . { 2amz 27mp 
= 2s sin (7) Io (77) ; (11.139) 


Notice the value of k appears not only as an argument of the sin(kz) term, but also in 
the argument of the modified Bessel function. 

As always, the A, coefficients can be determined using an orthogonality condition 
and the boundary conditions at p = rz. Operate on both sides of Equation 11.138 by 


L 
. { 2mmz 
fw sn ( L ) 


TE, 
, ae sia (7) O¢,2)= antl (=) (11.140) 
0 


to obtain 
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Figure 11.14 Cylindrical Laplace Problem Periodic in the z-Direction 


Now, if we require 


+V, 0<z<L/2 
—V, L/2<z<L 


and perform the integral on the LHS of Equation 11.140, the A, amplitudes evaluate 
to 


P(7r,,Z) = { (11.141) 
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0 n even 
An = AV, (11.142) 
——_..—_-.~ dd 
ntIo(2amr,/L) eee 
The final solution to this problem for p < r, becomes 
= 4V, . [{210z 27mnp 
PD(p, 6,z) = —— }. 11.143 
(O2= Do naly(2amr,/L) ( L ) 4 ( L ) Che) 


n=1,3,5,... 


In this example, the modified Bessel functions did not combine with the p- 
dependent operator to satisfy a Hermitian condition. Consequently, they were not 
orthogonal functions and could not be used to select a particular value of n. Instead, 
we used the orthogonality of the sine functions to perform the expansion. 


Example 11.5 Asa final example, consider the cylindrical boundary value problem 
shown in Figure 11.15. The potential is held at zero on the top and bottom and at 
+V, on the sides. The solution to Laplace’s equation is sought for p < r, and 
0<z<L/2. 

The first thing to decide is the form of the general solution. Is it going to be 
Equation 11.97, with oscillating solutions in p and nonoscillating solutions in z, or 
Equation 11.136, with oscillating solutions in z and nonoscillating solutions in p? 
The nonoscillating solutions in z clearly cannot work, because if the amplitudes are 
adjusted to make ® = 0 at z = 0, this solution cannot also be zero at z = L/2. 
Therefore, we must choose the form of Equation 11.136. 

Now that we have determined that the solutions must be periodic in z, what is 
the spatial period? Remember we tackled a similar problein in Cartesian geometry, 
where we successfully converted a nonperiodic problem into a periodic one. Let’s 
try the same process here. We can construct an infinite cylinder, of which the region 
shown in Figure 11.15 is but a single “cell.” The zero potentials at the top and bottom 


| L/2 


Figure 11.15 Boundary Conditions for a Cylindrical Can 
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Figure 11.16 The Cylindrical Can Converted to a Periodic Problem 


of this cell must be established by the periodic potential applied to the p = r, surface 
above and below this cell. This situation is shown in Figure 11.16. 

From Figure 11.16 it is clear that the solution inside the can will be identical to 
Equation 11.143, the solution obtained in the previous example. Of course, in this 
case the solution is only valid for 0 < z < L/2. 
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11.4 SPHERICAL SOLUTIONS 


The solutions to Laplace’s equation in spherical geometry are fairly complicated, but 
they are important to understand because of their frequent use in quantum mechanics. 
The organization of the periodic table of elements is due primarily to the nature of 
these types of solutions. We will take the same approach we did with the other 
coordinate systems and look for separable solutions. 

Recall, the spherical coordinates (7, 0, @) of a point P are defined as shown in 
Figure 11.17. The coordinates are limited to the rangesO <r < +~,0< @0< 7, 
and 0 < ¢@ < 27. The Laplace operator in this system becomes 


1 3 1 @ a 1 3 
Y= Sn 0 . 
rare’ * 72sin@ 00 no 99 * Fe sin® 0 ab? Chea) 
Again, we seek solution to 
V(r, 0,6) = 0 (11.145) 


by the method of separation of variables. However, because of the r-factor in the first 
term of Equation 11.144, it turns out to be more convenient to include a 1 /r factor 
in the separation process. That is, we will look for a solution with the form: 


P(r, 6,6) = AO POF). (11.146) 


Substituting this into Laplace’s equation and multiplying by 


r? sin? 6 
R(r)P(0)F(p)’ 


we obtain 


r*sin? 9 0*R(r) | sin@ d . _ aP(@) a 1 oF(p) _ 


RG) at * Pe 8! 99 t Rg) Garo (11.147) 


Figure 11.17 The Spherical Coordinates 
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Separating the @-Dependence Because the last term on the LHS of Equation 11.147 
contains all the @-dependence, it can be set equal to a separation constant. In the most 
common spherical problems, the range of ¢ is 0 < ¢@ < 27, and to keep the po- 
tential single-valued, we must have P(r, 0,6) = ®(r, 0, + 277). This means the 
F(¢) solutions must be periodic, and thus the separation constant cannot be positive. 
We will stick with the convention used by most scientists and define this separation 
constant as 


2 
gh ENO) —m <0. (11.148) 


F(p) ag? 


We have discussed the solutions to this equation already. F(@) can take either of two 
forms: 


+imd ‘ 
Fb) — {Sint or Fi) — {Sm (11.149) 


In electrostatics, where the solutions need to be pure real, the second form of 
Equation 11.149 is usually preferred. In quantum mechanics, the solutions can be 
complex, and the first form is frequently used. Regardless of the form, the periodicity 
over the range 0 < @ < 277 forces m to take on the discrete values 


m = 0,1,2,.... (11.150) 


Separating the r-Dependence Now we can proceed to separate the r-dependence. 
Substituting —m? for the @-dependent term, Equation 11.147 reduces to 


r?sin? 0 0°R(r) sind @ oP(0) 2 
+ SO* = sing ~ m? = 0. 11.151 
Rr) or? | P6008 Cone 


After dividing this entire equation by sin? @, the r-dependence is confined to the first 
term, which can be set equal to a second separation constant. For reasons that will 
become clear later, we will give this constant a rather peculiar form: 


r? @?R(r) 
R(r) ar? 


=2£(€ + 1). (11.152) 


What is the general solution for R(r)? One could go through the entire method 
of Frobenius to generate series solutions for this equation, but in this case, it is not 
difficult to see that the series will collapse to a single term. If we look for a solution 
with the form 


R(r) xr", (11.153) 
Equation 11.152 is satisfied if n satisfies the algebraic expression 


(n)\(n — 1) = £(@ + 1). (11.154) 
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This implies two solutions for n, 


n=£+1 (11.155) 
and 
n= —-£., (11.156) 
The solutions for R(r) then are 
poe 
R(r) > (rest F (11.157) 


This can easily be confirmed by substituting these solutions back into Equation 11.152. 
At this point, we cannot say too much about these solutions, because we do not know 
if € is real, imaginary, or complex. To determine the nature of €, the @-dependent 
equation must first be solved. 


Separating the 0-Dependence Substituting €(€ + 1) for the r-dependent term in 
Equation 11.151 gives 


19 5 gdP(@) __w 
P(@) sin 6 00 00 sin? 0 
This equation is called the associated Legendre equation and occurs often in quantum 
mechanics. 
The trigonometric functions in Equation 11.158 are difficult to manipulate, but we 
can simplify things quite a bit by introducing the variable change 


x = cos @. (11.159) 


Because 0 = @ = 7, the range for x is —1 = x = +1. The extreme values must 
be included in the respective ranges to include the z-axis. This turns out to be a very 
important fact when we begin to solve this equation. With this change of variables, 


+2 +1)=0. - (11.158) 


dx = — sin @ d@ and sin? @ = 1 — x”, and Equation 11.158 becomes 
2 
PN Gy ty a Es pe Ah BG) =. (11.160) 
ox Ox 1 — x2 


This new function P(x) is related to our original function P(@) by P(@) = P(cos 6). 
Notice Equation 11.160 is in Sturm-Liouville form. We can maneuver it to be in the 
form of an eigenvalue/eigenfunction equation as well: 


2 

\z (a - rd — te = —£(€ + 1)P(x). (11.161) 
ox Ox 1- x? 

Based on our previous experiences with equations in this form, we know the solutions 

will form a complete, orthogonal set. 

Because of its quantum mechanical significance, we will present a fairly detailed 
solution to Equation 11.161. We will begin by looking for solutions that have no 
o-dependence, i.e., solutions with m = 0 that are symmetric around the z-axis. Then 
we will tackle the general case. 
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11.4.1 Axially Symmetric Solutions with m = 0 
With m = 0, the @-dependence becomes 


F(o) > 1, (11.162) 


the r-dependence remains the same 


£ 
ae) = ee . (11.163) 
and the equation for P(x) becomes 
oP 
z ia - ead = —&(£ + 1)P(x). (11.164) 
Ox Ox 


At this point, it is clear that the solution for P(x) depends upon the choice of €, 
so let’s start labeling the functions with an € index: P(x) 3 P,(x). The solution to 
Equation 11.164 can be obtained using the method of Frobenius by setting 


Pe(x) = So agx"**. (11.165) 
With this substitution, 
we) = aint + s)x"t5~ 1 (11.166) 


and Equation 11.164 becomes 


90 


>, ‘é [an(n + s)x"*8"! — ant syne] + (64+ baa} =. (11.167) 


n=0 
Performing the second differentiation gives 


oe 


> [an(n + s\n ts — Ix"? — a(n + s)(n + 5 + Wx"? 
n=0 


+ €(€ + l)a,x"**| = 0. (11.168) 


The next step is to identify the coefficients of every power of x, and set them equal 
to zero: 


ag(ss — 1x? + ay + s)(s)x7! + S ann + s\(n+s-—1) (11.169) 
n=2 


—a,-2[n+s5—-2)n+s-1)-#e+ L)]}x"*5-2 -0. 
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The coefficients for the x°~* and the x°~! terms are individually set equal to zero to 
form the indicial equations: for s 


ao(s)\(s — 1) =O (11.170) 
a,(s + 1)(s) = 0. (11.171) 


Taking ag # 0, Equation 11.170 gives s = 0 and s = 1. These two values of s 
generate two solutions for Pe(x): 


Ae(x) = So anx"* from s = 1 (11.172) 
n=0 
and 
Bex) = So bnx" from s = 0. (11.173) 
=0 


Two different symbols, a, and b, have been used for the coefficients to distinguish 
between the two different solutions. 

The solution Po(x) = Ae(x) is generated with s = 1, ag # 0, and a, = 0. The 
solution Pe(x) = Be(x) is generated with s = 0, bp # 0, and b, = 0. It is worth 
mentioning that there are other possible choices. When s = 1, Equation 11.171 
requires a; = 0, but with s = 0, Equation 11.171 can be satisfied with a nonzero 5. 
So an alternate form of Bg(x) could be generated with both bp and b; nonzero. This 
new solution, however, would be a just a linear combination of our original Be (x) 
and A(x). Also, the indicial equations could be satisfied with ag = bo = 0, and both 
a, and b, nonzero. In this case, Equation 11.171 would require different values for 
S, namely s = —1 and s = 0, but we would generate exactly the same solutions as 
our original choice. This makes sense, because the form of the series expansion in 
Equation 11.165 implicitly assumes that ag is the lowest nonzero term. 

The last term in Equation 11.169 provides generating equations for a, and b, for 
n=2: 


= (nt+s—2)n+s—-1)-€€4+D) 

Qn = Qn-2 ESC ET Ea o (11.174) 
_ (n+s—2)n+s-1-LE4+1) 

bn ial b,-2 (n a s\n +5 —1) ——— = (11.175) 


We will start off both series with ap = 1 and by = 1. 
These series have very interesting convergence properties. For n > 1, a, = dn-2 
and b, = b,—2, so the ends of both series look like 


Ag(x) = +++ + Gap (x + x"F? + x4 + ++} (11.176) 
and 


Bo(x) = +++ + Dyoy (x7 $x + xt ed, (11.177) 
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Both these solutions must be valid for 0 = @ = a, or equivalently 1 = x = —I. 
From the ratio test on Equations 11.176 and 11.177, it is clear that the only way for 
these series to converge at the end points is to have them terminate, i.e., both a, and 
b, must be zero for some, not necessarily the same, value of n. So we must look for 
the conditions that allow these generating functions to start with nonzero do (or bg), 
but which eventually cause the series terminate. 

Starting with the a,, Equation 11.174 simplifies to 


(n — 1)(n) — £2 + 1) 


Qn = Qn— , 11.178 

n n-2 (n + 1\(n) ( ) 

Because a, = 0, this equation is evaluated for only even values of n, ie., n = 
2,4,6,.... Now, if a, is to be zero for some even value of n, the numerator must go 


to zero. That is to say, for some even value of n, 
(n— 1)(n) — 22 + 1) = 0. (411.179) 


This will cause a, and all subsequent a>, to be zero. Similar arguments conclude 
that the series representation for Bg(x) terminates at an even value of n given by 


(n— 2)(n— 1)-€(€ + 1) =0. (11.180) 


Because n is a positive, even integer, the only way either Equation 11.179 or 11.180 
can be satisfied is if € is a real, but not necessarily positive, integer. This is our first 
condition on the separation constant €(€ + 1) that isolated the r-dependence. Now 
we need to determine what these terminating solutions for P(x) look like. We will 
individually determine different solutions as € is indexed, ie., € = 0,+1,+2,.... 

Starting with € = 0, the A(x) series terminates when Equation 11.179 is satisfied, 
i.e., when 


(n-—1)(n)=0  forn = 2,4,6,.... (11.181) 


It is clear that no positive, even value of n will satisfy this condition. Thus, when 
£ = 0 the A(x) series of Equation 11.172 does not converge for x = +1, and is 
therefore unacceptable if the solution must be valid on the z-axis (where x = +1). 
With € = 0 the Bo(x) series terminates when Equation 11.180 is satisfied, i.e., when 


(n-2)n-1)=0 forn = 2,4,6,.... (11.182) 


This equation is satisfied with n = 2. So with € = 0, the Bo(x) series of Equa- 
tion 11.173 terminates and therefore converges for x = +1 with b, = 1, b,; = 0, and 
by = 0. Since the Ao(x) solution was unacceptable, the only terminating solution for 
£ = Vis 


Po(x) = Bo(x) = 1. (11.183) 


We will now work out the solution for € = 1 and a pattern for the solutions will 
become clear. Referring to Equation 11.179, the A;(x) series terminates when 


(n—-1)(n)—2=0  forn = 2,4,6,.... (11.184) 
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The 8,(x) series terminates when 
(n-2)(n—-1)-2=0 forn = 2,4,6,.... (11.185) 


As before, n must be a positive, even integer. Equation 11.185 can never be satisfied 
for any of the acceptable values of n, and so the series for B,(x) never converges for 
x = +1. Equation 11.184, however, is satisfied with n = 2. So for 2 = 1, the only 
series that converges for x = +1 is the one for Ay(x) and we have 


P(x) = A(x) = x. (11.186) 


This pattern will continue as @ is increased. The terminating solutions alternate 
between the Ag(x) series and the Be (x) series. The terminating solution is always in 
the form of a polynomial in x, with the order of the polynomial increasing with @. 

What about negative values of £? It turns out that negative, integer values of 
do not generate new solutions. For example, the € = —1 solution is identical to the 
£ = 0 solution, and the € = -2 solution is the same as the € = 1 solution. This is 
not surprising because € appears as the quantity €(€ + 1) in the differential equation. 
Setting 2 = —2 produces the same value of £(€ + 1) as does 2 = 1. So to cover all 
the possible terminating solutions, we only need consider integer values of £ greater 
than or equal to zero. 

These terminating solutions are referred to as the Legendre polynomials. The first 
six are 


Po(x) = 1 
Py(x) = x 


P(x) = (1/2)3x? — 1) 
P3(x) = (1/2)(5x? ~ 3x) 
P(x) = (1/8)(35x* — 30x? + 3). 


Each of these solutions has been normalized so that Pg(1) = 1. There is a clever 
equation, called Rodrigues’s formula, for generating these polynomials: 


Cp? an yk 
Poj= 5 


se ae (11.187) 


The general solution to Laplace’s equation with m = 0 is a linear combination of 
the Pe(cos 8) and the r-dependent solutions found in Equation 11.163: 


oo 


£ 
&(r, 6) = >a ee {Pelcos 6). (11.188) 


£=0 


Notice, by the arguments given above, only integer values of € greater than or equal 
to zero appear in the sum. 
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11.4.2 Orthogonality of the Legendre Polynomials 


From our experience with general solutions to Laplace’s equation in Cartesian and 
cylindrical geometries, it is clear that, in order to satisfy arbitrary boundary conditions 
using an expansion, some of the functions in the expansion must exhibit orthogonality 
properties. If there are two sums in the general solution, two of the expansion functions 
must have orthogonality properties. If there is only one summation then only one of 
the functions needs to have orthogonality properties. In Equation 11.188 there is only 
one summation and so either the r-dependence functions or the Legendre polynomials 
must obey an orthogonality condition. It isn’t difficult to see that the r-dependent 
functions will not be able to do this. So we must investigate the Legendre polynomials 
to determine their orthogonality properties. 

Remember the Legendre polynomials were the solutions to the differential equa- 
tion 


a]. a}, X 
= la a 3 | Pe(x) = —£(€ + 1)Pe(x). (11.189) 
Ox Ox 
The linear operator associated with this equation is 
F) 2 8 
= ~ xX}, - 11.190 
2 [u-g] a9 


which is in Sturm-Liouville form. We learned earlier that an operator of this type is 
Hermitian if Equation 11.57 is obeyed. That equation, for this case with p(x) = 1 — x’, 
becomes 


BEN) gy (11.191) 
dx 


Po(x)(1 ~ x?) 


Xmin 


where £ and £’ are two different positive integer values. We defined Xmgx and Xin 
to be the upper and lower limits of the problem. Generally, if the problem includes 
the z-axis, then the range of 0 is 0 = 6 = 7, and thus xmj, = —1 and Xmax = I. 
Consequently, the zeroing of the LHS of Equation 11.191 is accomplished automat- 
ically through the (1 -- x”) term. With this definition for 0 and using the Legendre 
polynomials, the operator of Equation 11.190 is Hermitian. 

Equation 11.189 is also in the form of an eigenvalue equation. The eigenvalues are 
—(€)(€ + 1) and the eigenfunctions are the P(x). Since we have already established 
the Hermitian property, we can immediately write down the orthogonality relation 


+t 
i, ax Pe(x)Pe(x)=0 L#24'. (11.192) 
= 


If we evaluate the above integral when € = €’, the orthogonality condition for these 
polynomials becomes 


+1 
. ‘ 2 
3 dx Pe(x)Pe(x) = 26] Oe (11.193) 
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In terms of 8: 


bs Ag is RN 2 
[ d6@ sin 6 Pe(cos 0)Pe:(cos 6) = ye he (11.194) 


11.4.3 Solutions Valid off the z-Axis 


If a spherical problem excludes the z-axis from consideration, the requirement that the 
solutions be finite at 9 = 0 and @ = am (x = +1) can be relaxed. A set of boundary 
conditions which leads to this type of solution is shown in Figure 11.18. These 
boundary conditions exist on a conical surface located at 6 = 0, and 6 = (7 — 6,). 
The potential is held at + V, on the top part of the cone, and —V, on the bottom. A 
solution for the potential in the region 0, < 6 < (a — 9,) is desired. 

The general solution to Laplace’s equation for this type of problem develops along 
the lines of the previous section. if there is no @-variation, the separable solution can 
be written as 


R(r) 


where R(r) is the same function we developed previously, 


£ 
me) a {en ; (11.196) 
and Q(@) is a new solution to Equation 11.158 with m = 0, which does not need to 
be finite on the z-axis. We use the function Q(@), because we want to reserve the P(0) 
notation for the Legendre polynomial solutions we derived in the previous section. 
The Q(6) solutions, of course, depend on the value of € and so we will, as before, 
use its value as an index: Q(8@) — Q¢(@). Again we will make the substitution 


Figure 11.18 Spherical Boundary Conditions that Exclude the z-Axis 


SPHERICAL SOLUTIONS 467 


x = cos @ and Qp(cos hd Mee Qe(8). The method of Frobenius can be used to develop 
the two solutions for Qe(x), and they will be identical in form to Equations 11.172 
and 11.173. The difference now, however, is that we must include the series solutions 
that do not terminate, the very ones we removed from the Pe(@) solutions because 
they did not converge at x = +1. 

For € = 0, we will define 


Qo(x) = Ao(x), (11.197) 


where Ao(x) is the Frobenius series of Equation 11.172 which we previously found 
unacceptable: 


Oo(x) = x + (1/3)x? + (1/5)x° + +e. (11.198) 


With some work this series can be expressed in closed form as 


> ] 1+ 
Oo(x) = = In es (11.199) 
2 1-x 
Similarly for € = 1, define Q)(x) as the nonterminating series, 
O1(x) = By(x), (11.200) 


which in closed form can be written as 


. + 
O1(x) = 5 (; =) 4 (11.201) 
2 1-x 
For € = 2: 
. 3x? - 1 lt+x 3x 
= a 11.202 
Ona) = = 5 n(; “*) 3 (11.202) 


There is a generating function for the Qe(x) functions that is rather complicated. It 
involves the Legendre polynomials 


1+ 2-1, 2é —5 
Oe(x) = praia (| *) — SP) — Gye. (11.203) 


Notice how these solutions diverge for x = +1. 

Keep in mind, for the Q¢(@) solutions, nothing says € must be an integer. The 
particular boundary conditions of the problem being solved will usually limit the 
possible values of £ to some discrete set. Often this is a set of integers. With integer 
values for €, the infinite series solutions for the Oe(x) solutions can be expressed in 
closed form and paired with the P(x) solutions to give two independent solutions, 
as summarized in Table 11.1. The Pe(x) and Oc(x) solutions for £ = 0,1, and 2 are 
shown in Figures 11.19 and 11.20. 
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TABLE 11.1. The P,(x) and Q¢(x) Solutions to 
Laplace’s Equation in Cylindrical Geometry 


£ s = 1 Solution s = 0 Solution 
0 Qo(x) Po(x) 

1 Py Qi) 

2 


Q2(x) P(x) 


Figure 11.20 The First Three Qe(x) Functions 


11.4.4 Solutions with @-Dependence 


Now we return to the solution of Equation 11.161, this time allowing a @-dependence. 
Remember the separated solution to Laplace’s equation in spherical coordinates had 
the form 


P(r, 0,6) = “DFP, (11.204) 


where 


£ 
BO eee (11.205) 


r 


SPHERICAL SOLUTIONS 469 


and 


ering 
FB) fo mp (11.206) 


A substitution of x = cos @ made P(@) = P(cos 6), and P(x) solved the equation 


fe) 1 9 a _ m2 < 2! 
iz la x 2 l-2 +£(E + 1} Poy =0. (11.207) 


From the form of Equation 11.207, it is clear that P(x) depends on both m and £, so 
we will label the solutions of P(x) with two indices: P(x) > Pom(X). 

A brute force Frobenius solution to Equation 11.207 is very messy. It is simplified, 
somewhat, by making the substitution 


Pom(x) = (1 — x29"? Wem(x). (11.208) 
With this, Equation 11.207 becomes 


2 
{0 =x") 2. ~2(m + Dae — [m+ m —£e + vy} Uem(x) = 0. 


(11.209) 
Using the method of Frobenius, we will try to find a solution of the form: 


Uem(x) = So anx"*', (11.210) 
n=0 

When this is substituted into 11.209, equations for the a, coefficients are obtained. 
For n = 0 and n = 1, the two indicial equations are 

ao(s(s — 1) = 0 (11.211) 
and 

ay(s + 1)(s) = 0. (11.212) 
For n = 2, the generating equation for the a,,’s is 


(nt+s—2)\(nt+s+2m~1l)+m+m —2£€4+1) 
(n+ s)\(n+5—1) ‘ 


(11.213) 


Gn — An-2 


There are two solutions for the indicial equations, s = 1 and s = 0. Thus we 
obtain two different solutions, 


Aem(x) = So anx™*! gs = 1 (11.214) 
n=0,2.,... 
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and 
Bom(x) = = b,x" s=O, (11.215) 
n=0,2.... 


where, as usual, we have defined ag # 0, by) # 0, and a; = b,; = 0. The generating 
equation for the a,,’s, for n = 2 becomes 


(n— 1)(n + 2m) + m+ m — £(€ +1) 
.3.7.COrOr.. oe eee 


= 11.216 
Ta ae (n+ In) ( ) 
and the one for the b,,’s becomes 
—2)(n+ 2m—-1)+m+m?-L€4+1 
eh oe Dee ee (11.217) 


n(n — 1) 


Asn — ©, both a, = a,—2 and b, = b,-2. This means that for these solutions to be 
valid on the z-axis (where x = +1), the series solutions must terminate. This is the 
same argument used in the derivation of the Lengendre polynomials. 

Let’s look at the termination of the s = 0 series, the one for Bg,,(x). For this series 
to terminate 


(n — 2)(n + 2m —- 1) + m+ m’ —£(€4+1)=0. (11.218) 
With some rearrangement this condition for termination becomes 
(n+m—2)(n+ m—1)= €(€ + 1). (11.219) 


It should be noted that this simple condition for termination was what originally 
motivated the substitution of Equation 11.208. Had we not made that substitution, 
this condition would have been much more difficult to deal with! At this point, all we 
know about the terms in Equation 11.219 is that, according to Equation 11.150, m 
must be a nonnegative integer, and that n is a positive, even integer. This means that 
the first factor on the LHS of Equation 11.219, (n + m — 2), is an integer and that 
the second factor, (1 + m — 1), is equal to that same integer plus one. Therefore, if 
Equation | 1.219 is to be satisfied, and the series is to terminate, € must be an integer. 
As was the case before, the solutions derived from considering negative values of 
£ are just repetitions of the positive € solutions. Therefore, we need only consider 
£051, 25 sien 

An interesting result of Equation 11.219 is a new condition on m. Pick some 
positive, integer value for €, say € = £,. Termination then occurs when 


(n+m—2)\(n+m— 1) = £,(€, + 1). (11.220) 


If we set m = £,, termination can occur with n = 2. If m > £9 the LHS of 
Equation 11.220 is always greater than the RHS and termination can never occur. 
When m = £5 — 1, there is no way to satisfy Equation 11.220 with an even value of n. 
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If we move m down another notch, and set m = £9 —2, we can satisfy Equation 11.220 
again with a choice of n = 4. If we continue this process, we will find we can satisfy 
the condition in general for every other integer value of m until we reach either m = | 
or m = 0, depending on whether €, was even or odd. That is, the values of m can be 


£, (n = 2) 
£,-2 (n=4) 

m= 40-4 (n=6), (11.221) 
Le 


Remember this requirement is for the s = 0 series. 
If we determine, in a similar way, the termination condition for the s = | series, 
we find the relationship 


(n+m— 1)\(n+m) = £,(€, + 1). (11.222) 


The highest value that m can take on and still satisfy this equation is 2, — 1, with the 
n = 2. If m gets higher than this value, it is impossible to satisfy the relationship. 
If we try m = £, — 2, again we find we cannot satisfy Equation 11.222. But when 
we move down to m = £, — 3, we can satisfy it with n = 4. Much like we derived 
Equation 11.221, we find all the possible m values are given by 


£,-1 (n=2) 
£3 (n = 4) 
we2.to=s = 6) (11.223) 


1orO 


where again, the lowest value of m depends on the starting value of £,. 

When we consider both the s = 1 ands = Oseries solutions collectively, we find an 
important fact. To have at least one valid solution, m must obey either Equation 11.221 
or Equation 11.223. In other words, 0 = m = £p. 

Now that we know the range of m, we need to determine what the terminating 
P2,,(x) solutions actually look like. We will start with the £ = 0 solution. According 
to the previous discussion, with £ = 0 there can only be a terminating series solution 
if m = 0, and it must come from the s = 0 series with n = 2. Therefore, using 
Equation 11.208, 


Poo(x) = Uoo(x) 
= Bo (11.224) 
= bo. 


The particular value of b, is usually chosen as 1. This solution is not terribly exciting, 
since it is just a constant. 
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With £ = 1, there are two possible m values: m = 1 and m = 0. The solution 
P,1(x) uses the s = 0 series with n = 2 so 
Py(x) = 1 x*)'?7U 1) 
= (1 — x2)'B) (x) (11.225) 
= (1 — x?)!/2bp. 
As before, we usually set b, = 1. The P,o(x) solution uses the s = 1 series with 
n = 2, which gives 
Pyo(x) = Uyo(x) 
= Ajo(x) (11.226) 
= aox. 
If we set ap = 1, Pio equals the P,(x) Legendre polynomial we derived earlier. 
In this way the solutions for all the terminating Pen(x) solutions, valid on the 
z-axis, can be developed. The solutions are collectively referred to as the associated 


Legendre polynomials. A generating function for these polynomials, a generalization 
of Rodrigues’s formula, has been developed: 


: 1 2/2 (x? — 178 
These functions obey the orthogonality condition 
1 
‘ ‘ _{ 2 \@+m! 
ie Ax Pen(x)P em (X) (a . :) @— myince” (11.228) 
which in terms of the variable @ is 
Oe a : : 2 (€ +m)! 
| dO sin OP pm (COS 0)Peim(cos 8) = ( aoe 7 Ea mie (11.229) 


Finally, the general solution to Laplace’s equation in spherical coordinates using 
the R(r), F(@), and P(@) functions can be written down. The result is 


wo +e ré etine n 
Dr, 0,6) = a ee Vos Pem(cos 6). (11.230) 
£=0 m=0 


Example 11.6 We will now use these results to find the electric potential inside the 
spherical surface shown in Figure 11.21. Suppose that different electric potentials 
have been applied on each quadrant of the surface, as indicated in the figure. We will 
only show a general method of solution here and leave complete solutions for the 
exercises at the end of this chapter. 
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+V, 0<@<n/2, 0O<o<R 


®(R,9,90)= -V 0<@<n/2, n<b<2n 
+V, W/2<O0<n, N<O<20 
+V. W/2<O0<n, O<O<z 


Figure 11.21 Boundary Conditions on a Spherical Surface 


Start with the general spherical solution in the form 


oo £ e 
P(r, 0, ) = DF | os { Loa) {Pen(c0s 0) (11.231) 


pare sin(m@) 


We have chosen to use trigonometric functions for the @-dependence instead of 
complex exponentials, because the electrostatic potential should be a real quantity. 

Immediately, we can make some useful simplifications. All the r~*~' solutions 
must vanish because the potential must be finite at r = 0. In addition, the cos(m@) 
solutions are also zero because the boundary conditions, and therefore the solution, 
have odd symmetry about @. This reduces Equation 11.231 to 


foe) £ 
Or, 6,6) = s; > Aemr~ sin(md)Pem(cos 6). (11.232) 
£=0 m=0 


Notice that there is a single amplitude constant Ag, which is indexed by both € 
and m. 

Values for this constant are determined by requiring the solution to go to the 
prescribed values at the boundary r = R,: 


00 £ 
B(Ry, 0,6) = S~>S~ AemRS sin(md)Pen(cos 0). (11.233) 


£=0 m=0 


A particular m value can be selected using the orthogonality of the sine functions. A 
particular € value can be isolated using the orthogonality of the associated Legendre 
polynomials. Using these orthogonality conditions allows a solution for Ag»: 


_ (2%+1\ &-m! £7" greet et : 
Agm = (Fae \e | do : d@ sin 6 sin(m@)P¢,,(cos 8)P(Ro, 6, d). 


(11.234) 
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This integral determines the Ag,, which are then inserted into Equation 11.232 for 
the solution inside the spherical surface. 


11.4.5 Spherical Harmonics 


It is customary to combine the e””"? functions with the associated Legendre polyno- 
mials in Equation 11.230 to define a new function of @ and 0, the so-called spherical 
harmonics: 


ay. [(2E+ IE = mM)! img > 
Yon(8,6) = (—1) ay 7 e'® Po,,(cos 8). (11.235) 


The complicated multiplying factor just normalizes the function, so that the orthog- 
onality condition, discussed shortly, is as simple as possible. The (—1)” factor is a 
convention used frequently in quantum mechanics. Notice, because we separated out 
the e’"* factor, m must now be allowed to vary between —£ and +2£. To make this 
work, we define the associated Legendre polynomials with negative values of m to 
simply be proportional to the same polynomial with a positive m: 


(-1)"(€ — m)!. 


Pe -m(X) = Era Pg m(X). (11.236) 


The weird proportionality constant makes Equation 11.227 continue to hold for all 
values of m. 
Using the spherical harmonics, the general solution of Equation 11.230 becomes 


ro) +£ 


£ 
Or, 0,6) => >> { nbs } Y om(9, ). (11.237) 


£=0 m=—-£ 


The spherical harmonics obey an orthogonality condition: 


2a 7 
7 dp | dO sin 0 Y3,(O,D)¥ pim(O,b) = See: Bmum'- (11.238) 
0 0 


Notice, since the spherical harmonics are complex functions, the orthogonality rela- 
tion involves a complex conjugate operation. 
The first few Ye,,’s are: 


Yoo(8,0) = 1/40 


Y,_,(0,6) = /3/82sin 6e 
Y,.(0.0) = V3/41cos 0 


¥\,(0,6) = —V3/87 sin be'* 
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Y,-2(0,6) = 1/45/9627 sin? 6e~7¢ (11.239) 
Y,-1(0,6) = 1/45/247sin @ cos de‘ 


Y9(86,6) = V5/4a € cos? @ — 5) 


¥>,(0,¢) = —1/45/247rsin 6 cos be'* 
Y(0,6) = /45/96r sin’ 6e'?. 


These functions can be visualized by making three-dimensional, polar plots of 
IY em(9, &)I? vs. 6 and db. The plot for Y99(8, @) is shown in Figure 11.22, where the 
magnitude squared of the spherical harmonic is represented by the distance from the 
origin to the surface. Similar surfaces are shown for the € = 1 and € = 2 spherical 
harmonics in Figures 11.23 and 11.24. In these figures, the z-axis is vertical, and the 
surfaces are always symmetric about this axis because |e"*|? = 1. Also, the plots 
for +m are always exactly the same, because |Y¢,,| = |Ye—ml. 

The relevance of these spherical harmonics to the quantum mechanical wave 
functions for the hydrogen atom should be clear to anyone with a chemistry or 
physics background. It is important to realize that the restriction on m, i.e., that 
|m| = 2, is a mathematical result, originating from our requirement that the solutions 
converge for all values of 6. 


f ¢. 


Y,:(8,0) Y (9,0) Yi1(0,6) 
Figure 11.23 Surfaces Representing the Y ,,,(6, ) 
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> 


RK € t 


Y22(0,0) Y21(8,6) 
Y2-2(0,) ¥2-1(0,) Y¥20(8,0) 


Figure 11.24 Surfaces Representing the Y,,,(0, 6) 


EXERCISES FOR CHAPTER 11 


1. Find a series solution for the electric potential inside a two-dimensional square 
box, with sides held at the potentials shown below: 


®=0 


2. Find the two-dimensional solution to Laplace’s equation inside a rectangular box 
of length a and height b, if the boundary conditions for the solution are as shown 
in the figure below: 


3. Find the solution to Laplace’s equation inside a 1 X 1, two-dimensional square 
region, if the solution on the boundary is as described in the figure below: 
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y 
_o=x 
o=0 o=y 
x 
o=0 


4. Consider the L X 3L rectangular box shown below. 


0 +1 0 
L 0 0 
0 +1 0 
_ 3L 


The electrostatic potential on the boundaries of this box is indicated in this figure. 
The end caps of the box are held at @ = 0 zero, while the central section has 
® = 1. What is the potential everywhere inside the box? 


5. Find a series solution for the potential inside the two-dimensional rectangular 
box, with sides held at +V,, as shown in the figure below: 


®=-V 


oO 


L, @=+4V, @=+V 


6. Find ®(x, y), the solution to Laplace’s equation in two dimensions between the 
two parallel planes y = 0 and y = a,. Assume ®(x, 0) = 0, and that (x, a.) is 
periodic with period 2b,. Over one period, this function is given by 
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_f+1 O0<x<b, 
Tee or by <x < 2b,’ 
and is periodic with period 2b, for all x. 


7, Find the two-dimensional solutions to Laplace’s equation inside the two isosceles 
right triangles, with the boundary conditions shown in the figures below: 


8. Consider the following two-dimensional boundary conditions and the solution 
to Laplace’s equation inside this boundary. 


(a) Set up the periodic boundary conditions that will generate a solution to 
Laplace’s equation inside this rectangular region and indicate the region 
of this periodic problem where its solution is equal to the solution of this 
rectangular problem. 


(b) Obtain a solution to Laplace’s equation inside the rectangular region that 
satisfies the above boundary conditions. 


(c) Repeat the above steps for the boundary conditions shown in the following 
figure. 


2L, 
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9. For this problem, you are to develop the Green’s function for a square drum head 
lying in the xy-plane, as shown below. The boundaries at x = 0,L and y = 0,L 
are held at h = 0. 


The displacement of the vibrating drum head h(x, y)e’” is governed by the 
partial differential equation 
“it 
@ 


where d(x, y)e’” is a driving function. The Green’s function for this problem 
therefore satisfies 


V7h(x, y) + A(x, y) = d(x, y), 


2 
G + =) a(rlé vg) = 80x ~ &)8Ly ~ &). 


(a) As the first step in finding the Green’s function, turn this problem into one 
that is infinite in the x- and y-directions. Indicate the locations and nature of 
the driving 5-functions for a periodic problem whose solution is identical to 
the drum head problem in the region0<x<LandO<y<L. 


(b) Find a series solution for g(xlé,, yl). 


10. The potential between two concentric, electrically conducting cylindrical sur- 
faces is a simple electrostatic problem. If the inside cylinder of radius r, is held 
at a potential V;, and the outer cylinder at r2 is held at V2, the potential between 
the surfaces is 


V, - V2 V; Inr. — V2Inr; 
ie 
) In (72/71) 


In (r 1 / 2 
Verify this expression and show how it is given by the general cylindrical solution 
presented in this chapter. 


Dr) = 


11. Consider the two-dimensional polar form of Laplace’s equation 


fe 10 1 2 
+ 


— + -— + ——| ulp, 6) = 0 
6p? pdp_— p? a? 2 


and the separated solutions of the form u(p, 8) = R(p)H(@). 
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(a) What are solutions for H(@)? 

(b) Determine the two independent solutions for R(p). 

(c) Using the results from parts (a) and (b), determine the two-dimensional 
solution to Laplace’s equation inside a circle of radius rj, if on the circle, 
the solution goes to u(r,, 0) = V, sin 0. What is the potential outside this 
circle, for p > ro? What assumptions have to be made to get the answer for 
p >To? 


12. Find the solution to Laplace’s equation inside a cylinder of radius r, and length 
L, with the boundary conditions shown in the figure below: 


<> 


Now find the solution inside the same cylinder with the boundary conditions 
defined as shown in the figure below: 


<> 


13. Find the solution to Laplace’s equation inside a cylinder of radius r, and length 
L, with the following boundary conditions: The sides and bottom are held at 
® = 0 and the top is held at ® = V,(1 — p/r,), where p = 0 is the axis of the 
cylinder. Use a reference such as Abramowitz and Stegun to at least make a stab 
at the integrals. 
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14. Find the solution to Laplace’s equation inside the hollow cylindrical region of 
length L between 7; and 72, if the solution is to go to zero on the top and bottom 
and to +V, and —V, on the outer and inner surfaces, as shown below: 


tr, 


® (r,) =+V, 


® (r,) =-V, 


To solve this problem you must construct a new eigenfunction Mo(kp), which 
is a linear combination of Jg(kp) and Ko(kp), such that Mo(kr;) = +1 and 
Mo(kr2) =-~1. 


15. Find the solution to Laplace’s equation for the same cylindrical region as de- 
scribed in the previous problem, but with the different boundary conditions shown 
below: 


= 


® (1,) =0 


® (r,) =0 
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16. For small, static displacements, the deflection of a drum head satisfies the fol- 
lowing two-dimensional differential equation in polar coordinates 


TV7x(p, 0) = F(p, 8), 


where T is the tension in the drum membrane, z is the displacement, and F is the 
force per unit area applied to the membrane. Let the drum head have a radius r,, 
as shown in the figure below. 


Take the applied force per unit area to be independent of 0, so that the differential 
equation becomes 
1d F dz(p) 
pdp dp 


= F(p). 


(a) Determine the Green’s function g(p|é) that allows the displacement to be 
written as 


a ) "dé fOg(olé). 


(b) Plot g(plé) vs p. 
(c) If F(p) = 1 — p*/r?2, set up the Green’s function integration for z(p). 
17. This problem considers the two-dimensional solution to Laplace’s equation 


in cylindrical coordinates from a Green’s function point of view. For no z- 
dependence, find the Green’s function g(p, 6|0,) that allows the solution 


21 
(p, 6) = i 40, f(0,)e(p, 6180). 
0 


The function f(6) = (r,, 0) defines the boundary condition at p = rj. In- 
side this boundary ®(p, 0) satisfies a two-dimensional Laplace’s equation in 
cylindrical coordinates 


IP’ D(p,6) | 1dD(p,6) | 1 dH(p, 6) 
So he SE pe ee eg) 
dp? p op p> ae? 
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18. 


19. 


20. 


21. 


The standard way to solve this problem sets ®(p, 6) up as a series which must 
be shown to converge. 


Using similar steps to those in the previous problem, find the Green’s function 
for the two-dimensional homogeneous equation 


V7u(p, 0) + k?u(p, 0) = 0 


inside a circle of radius p = Ro, with the boundary condition that u(R,, 0) = 
So( 9). 


The general form for the solution to Laplace’s equation in cylindrical coordinates 
that is not periodic in z was expressed in this chapter using a shorthand notation 
that did not include the amplitude constants: 


J, (kp) { sin (v6) { sinh (kz) 


D(p, 6,z) = we cos(v@) | cosh(kz) ° 


If this expression is expanded out, eight amplitude constants need to be intro- 
duced. If a solution is sought for p < r,, the Y,(kp) solutions can be eliminated 
because they diverge as p — 0. Therefore, the number of amplitude constants is 
reduced to four. If, however we exclude the origin from the domain of the solu- 
tion, i.e., if we limit the problem to a range a < p < b, both the J, (kp) and Y,(kp) 
functions are necessary and all eight amplitude constants must be considered. 
The amplitude constants are not the only complication for this a < p < b 
situation, because the spatial factor k is also determined by boundary conditions. 

Consider a solution valid for a < p < b that is zero at p = a and p = b. 
To solve this problem, construct a new eigenfunction from the linear sum of the 
Bessel functions. Call this new eigenfunction R,(kp) where 


Ry (kp) = A Jy(kp) + B Y,(kp). 


Discuss how you would determine the set of values for k and the linear scaling 
factors A and B, so that this new eigenfunction is normalized and satisfies the 
zero boundary conditions at p = a and p = b. 


The potential between two concentric, electrically conducting spherical surfaces 
is a simple electrostatic problem. If the inside sphere of radius r, is held at a 
potential V,, and the outer sphere at rz is held at V2, the potential between the 
surfaces 1s 


_, Vilr— 723 " nr(Vi — V2) 1 
Tr) — 1 m7, r 


Or) 
Verify this expression and show how it is given by the general spherical solution 
presented in this chapter. 


Find the potential between two concentric spherical surfaces of radius R, and 
2R,. The voltage on the upper hemisphere of the outer surface is held at + V,, 


22. 


23. 
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and the voltage on the lower hemisphere of the outer surface is held at —V,. The 
voltage on the upper hemisphere of the inner surface is held at —V,, and the 
voltage on the lower hemisphere of the inner surface is held at +.V,. 


A hemispherical shell of radius R,, held at ® = +V, is positioned on top of 
an infinite conducting ground plane held at ® = 0. Use symmetry, in this case 
sometimes referred to as the method of images, to find the solution to Laplace’s 
equation both inside and outside the hemisphere that satisfies these boundary 
conditions. 


Find a series solution for the potential inside a spherical shell whose Hemispheres 
are held at +2V, and — V, as shown below. 


Nay, 


Can you think of an easier way to solve this problem, using the result of the 
previous exercise? 


- Helmholtz’s equation is written as 


V2-¥OH +h V@H =0 


where k? is a positive constant. Show that in spherical coordinates this equation 
separates and the differential equation the radially dependent component satisfies 
becomes 


d*R,(r) dR,(r) 
2 n n 

+ 2. 
: dr2 ‘ dr 


+ [K?r? — n(n + 1)] Ra(r) = 0 


where n is an integer. 

(a) Compare this to Bessel’s equation. 

(b) Show that R,,(r) for different values of n can be combined with the operator 
and appropriate boundary conditions to satisfy the Hermitian condition. 
Identify those boundary conditions and the space over which the integration 
must occur to establish the orthogonality condition for these functions. 

(c) Show that the substitution 


Zn(kr) 


Vir 


gives a form of Bessel’s equation, with Z,,(kr) being the Bessel function of 
order n + 1/2. The functions R,(kr) are called Spherical Bessel Functions. 


R,(kr) = 
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25. The quantum mechanical operators L, and L_ are given by 
; oe! ?., 7] 
Ly =L, + ily = e* E + jot | 


and 


io [| O r) 
L_=L,-iLy = -e"* j . 
x — iby e E jcot 0, | 


The spherical harmonics Y¢,,(0, @) are not eigenfunctions of these operators. 
However, interesting things happen when these operators work on the Y¢,,(0, ). 
Show that 


Ls Yem(0, 6) = Jl — ml + m+ V)¥em+1(0, 0) 


and 


L_Yem(0, 6) = JL + ml — m+ 1)¥em—1(8, d). 


Because of this property, L; and L_ are called raising and lowering operators. 


26. Find the potential between 60° < @ < 120° of two infinite 60° cones, one held 
at +2V, and the other at + V,, as shown below: 


27. Show that the eigenvalues of an Hermitian matrix are pure real quantities and 
that the eigenvectors are orthogonal. 
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28. Because of their orthogonality and completeness properties the Legendre poly- 
nomials can be used to make series expansions of functions 


f(x) = S> CePe(x). 


£=0,1.... 


Let f(x) be the triangular function over the range —1 < x < 1 shown below: 


(a) What are the first three coefficients of the Legendre polynomial expansion 
of this function? 

(b) What does the Legendre expansion for this function look like for |x| > 1? 

(c) Consider a new function which is a similar triangular function, except now it 
extends from x = —2 tox = 2. Construct a Legendre polynomial expansion 
for this function and identify its first three coefficients. 

(d) If now f(x) = 1- x? over the interval —1 < x < +1, what are the first 
three coefficients of its Legendre polynomial expansion? 


29. The Legendre polynomials are determined to within arbitrary constants by the 
homogeneous differential equations they satisfy: 
Po(x) = Ao 
P 1 (x) = Aix 
P2(x) = Ag(1 — 327) 


Determine the constants Ag, A;, and Ap for the first three Legendre polynomials, 
so that the functions obey the orthogonality condition 


1 
7 < 2 
dx Po(x)Pe(x) = —— dee. 
J axPecoreowo eee 
30. The following differential equation is set up in eigenvalue/eigenfunction form, 


E cos 6 d 
+ 


762 wo a P,(0) = —AP, (6), 
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31. 


32. 


where 6 ranges from zero to zr, and P)(@) is an eigenfunction of the differential 
operator with the eigenvalue A. This operator is not Hermitian. 


(a) Place this differential equation in Sturm-Liouville form and show that the 
operator that results is Hermitian. 


(b) Using this new Hermitian operator show that if m # n 
Tw 
i dO g(9)Pm(A)P.(8) = 0 
0 


and determine the weighting function g(6). 


Consider the differential equation 


d d 
( + oF ) n(x) = —An bn(X), 
x 


dx? 
where a is a positive constant. This equation is in eigenfunction, eigenvalue 


form. 


(a) If A, is assumed to be positive and real, show that the two solutions for the 
eigenfunctions become 


e @/? sin (xVdn — 07/4) 
e @/2 cog (xVan — 07/4) 


and that this form is valid for any value of A, # a?/4. 


(b) Now require that the eigenfunctions of part (a) satisfy the following boundary 
conditions: @,(0) = Oand ¢,(L) = 0. These boundary conditions determine 
the acceptable values for the A, as well as the acceptable ¢,(x). Use the 
n-index to identify these acceptable values for the A,, and make a plot of the 
first three acceptable eigenfunctions. 


n(x) > 


(c) The differential equation given above is not in Sturm-Liouville form. Find 
the appropriate multiplying function and place it in S-L form. 


(d) Determine the orthogonality conditions for the eigenfunctions of part (a) 
that satisfy the boundary conditions of part (b). Be sure to identify the region 
over which the functions are orthogonal and explain how you arrived at your 
answer. 


The Gram-Schmidt orthogonalization procedure generates a set of orthonormal 
functions (or vectors) from a set of independent functions (or vectors). For this 
problem start with the set of independent functions y,(x) = x”. A number of 
different orthonormal sets can be constructed from these y,(x), depending on 
the weighting function used. Let ¢,(x) be the set of functions generated from 
different sums of the y,(x), and let the orthonormalization be given by the 
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following integration 
| dx Gn(x)dm(x)e * = 5nm> 
0 


where a weighting function w(x) = e~* has been used. 
(a) The first function (x) is determined by setting it equal to a constant times 
yo(x) and then normalizing: 


do(x) = 400 Yo(x). 


What is do(x)? 
(b) The second function ¢;(x) is set up as a linear sum of y;(x) and p(x): 


1 (x) = ary yi (x) + Dio ho(X). 


Find the constants a); and bj9 that normalize @,(x) and simultaneously make 
it orthogonal to d(x). 

(c) Continue this process and construct ¢2(x). Set up ¢@2(x) as a linear sum of 
ya(x), bo(x), and d(x): 


b2(x) = az yo(x) + ba ho(x) + br by (x). 


Find the constants a2, bo9, and b2, that normalize ¢2(x) and make it orthog- 
onal to }o(x) and ,(x). 
These @¢,,(x) functions are known as Leguerre polynomials. It is interesting to note 
the Legendre polynomials can be generated by the Gram-Schmidt process using 
the same set of independent functions y,(x) = x” and the orthonormalization 
given by 


5. 


1 
2 
a dx n(x) bm(X) ve In+1 1 


33. The elliptical coordinates (u, v, Zz) are related to a set of Cartesian coordinates by 
the equations 


x = p,coshucos v 
y = p, sinhusin v 
Z=2, 


where p, is a positive constant needed to make the above equations dimensionally 
correct. Laplace’s equation for V(u, v, z) becomes 


Vu, v,2) 1 ( Fe ae 


f+ 2 \ wu,v,9 = 0. 
az 2p2[cosh (24) — cos (2v)] \ au =) Ge 
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35. 


Assume V(u, v,z) can be separated, ie., that V(u,v,z) = f(u)g(v)h(z), and 
determine the three ordinary differential equations that f(u), g(v), and h(z) 
satisfy. Define the necessary separation constants. 


. Consider a two-dimensional Cartesian coordinate system and a two-dimensional 


uv-system with the coordinates related by 
X= —ukU 
y = (1/2)? = v’). 
In general, Laplace’s equation in two dimensions can be written as 


1 0h @ oh, a 
V-v(q1, \- late ene 
au? hyh2 [@qi hy 0g, ~—-9q2 hz 0G2 


(+) 
: 0g: qi) ~ 


(a) In the xy-plane, sketch lines of constant u and constant v. 

(b) Express Laplace’s equation using the uv-coordinates. 

(c) Use the method of separation of variables to separate Laplace’s equation in 
the uv-system and obtain the general solution for V(u, v). 


V(41,92) =, 


with 


Let n(y, t) represent the number density (the number of particles per unit volume) 
of neutrons in a slab of nuclear reactor material. Take the slab to be infinite in 
the x and z-directions, so that there is no variation in the number density with 
x or z. The motion of the neutrons is described by a diffusion coefficient D. 
The neutrons can also collide with atoms in the slab, creating unstable isotopes. 
When an unstable isotope decays, more neutrons are released than were needed 
to make the atom’s nucleus unstable. This results in a source of neutrons that is 
accounted for by a collision frequency v. Both D and v are positive, real numbers. 
The diffusion and source effects can be combined to give a partial differential 
equation that describes the evolution of n(y, f): 


a*nly, t) anty,t) 
D — + vanly,t) = . 
ay at 
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At y = 0 and y = L, the neutrons can easily leave the slab, so the number 

density is always zero at these boundaries. 

(a) Using the method of separation of variables, obtain a series solution for 
n(y, t) that obeys the above boundary conditions and is valid for an arbitrary 
initial condition, i.e., n(y,0) = no(y). 

(b) Using your answer to part (a), evaluate the series coefficients for the initial 
condition np(y) = N,8(y — L/2). 

(c) The solution for an arbitrary initial condition can be formulated as a Green’s 
function integral. What is this Green’s function, g(y|é, t)? Express the solu- 
tion n(y, t) in terms of a Green’s function integral and show that n(y, 2) goes 
to your answer to part (b) if n,(y) + N,6(y — L/2). 

(d) Using your solution to part (a), determine the thickness L of the slab, which 
results in a nontrivial, steady-state solution for n(y, t) as t > ©. 

(e) What are the consequences of having a slab thickness greater than the value 
you calculated in part (d)? 


12 


INTEGRAL EQUATIONS 


Differential equations have unknown functions or dependent variables inside differ- 
ential operators. In contrast, integral equations have unknown functions or dependent 
variables inside integral operations. For example, 


d2 
Gai 9) = $@) (12.1) 


is a typical differential equation, with one dependent variable @. This differential 
equation can be easily converted to an integral equation using a Green’s function 
argument. The Green’s function solution to Equation 12.1 takes the form 


(x) = | dé o(é)e(al€), (12.2) 


—o 


where g(x/|é) satisfies 


d2 
ysl) = (x — &). (12.3) 


Equation 12.2 is an integral equation because the dependent variable @ appears inside 
the integral operation, and there are no differential operations. 


In this chapter, we discuss four general classes of integral equations, the connec- 
tion between differential and integral equations and several methods of solution. In 
Appendix G, a brief summary of the calculus of variations is given. This is a technique 
that uses the structure of an integral equation and minimizes the value of a definite 
integral with respect to an initially unknown function embedded inside the integral 
itself, 
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12.1 CLASSIFICATION OF LINEAR INTEGRAL EQUATIONS 


The integral equations discussed in this chapter are all linear in the dependent variable. 
They fall into four categories: Fredholm equations of the first and second kind, and 
Volterra equations of the first and second kind. These classifications are important 
because the approaches to solving these equations are different in each case. 


12.1.1 Fredholm Equations of the First Kind 


In a Fredholm equation of the first kind, the dependent variable ¢ exists only inside 
the integral operation, and the limits of integration are constants. For example, 


b 
f(x) = i dx! b(x')k(x, x') (12.4) 


is a Fredholm equation of the first kind. Both f(x) and k(x, x’) are known functions 
of the independent variable x. The function k(x, x’) is referred to as a “kernel.” 


12.1.2 Fredholm Equations of the Second Kind 


Fredholm equations of the second kind differ from Fredholm equations of the first 
kind in that the dependent variable appears both inside and outside the integral 
operation. For example, 


b 
d(x) = f(x) + af dx! b(x')k(x, x’) (12.5) 


is a Fredholm equation of the second kind. Again, @(x) is the unknown function, but 
both f(x) and the kernel k(x, x’) are known. The limits of the integral, a and b, and A 
are all constants. 


12.1.3. Volterra Equations of the First Kind 


Volterra equations of the first kind are similar to Fredholm equations of the first kind, 
except the limits of integration are not constants. The unknown function still appears 
only inside the integral. Therefore, 


f= A dx' b(x')k(x, x’) (12.6) 


is an example of a Volterra equation of the first kind, because the upper limit of the 
integral is the variable x. 


12.1.4 Volterra Equations of the Second Kind 


Finally, a Volterra equation of the second kind has the unknown function both inside 
and outside the integral operation, and the limits of the integral are not constants. The 
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2quation 
xX 
(x) = flx)+ / dx! b(x'yk(x, x" (12.7) 
a 


is an example of a Volterra equation of the second kind. 


12.2. THE CONNECTION BETWEEN DIFFERENTIAL 
AND INTEGRAL EQUATIONS 


Differential equations can be converted to integral equations. In this process, the 
boundary conditions associated with the differential equation are automatically in- 
corporated into the integral equation. Thus, unlike differential equations, integral 
equations will have unique solutions without specifying additional conditions. 

Let’s see how a ordinary second-order, linear differential equation can be converted 
to an integral equation. We start with the most general form 


2 
ae + AG oe) 2) + Booy(x) = s(2), (12.8) 


where the functions A(x), B(x) and s(x) are presumed to be known. Because this is 
a second-order equation, two boundary conditions are required to make the solution 
y(x) unique. For this case, we will specify y(x) and its derivative at the point x = a: 


y(a) = Yo (12.9) 
and 
ge) = y,. (12.10) 
dx 


To convert Equation 12.8 into integral form, all the differential operations must 
be removed by successive integrations. The first such integration is accomplished by 
changing the variable in Equation 12.8 from x to x’, and then operating on both sides 


with 
| dx’. (12.11) 
a 
After this integration, Equation 12.8 becomes 
I I 
noe 
dx! dx! a 


+ a dx' B(x')y(x') = ced a Gly. 


a a 
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Applying the boundary condition of Equation 12.10 and integrating the first integral 
on the LHS by parts gives 


dy(x) 
dx 


a“ ‘) 


- y+ Acdy(ah |) — [a Sve 


+ [ dx' B(x')y(x') = ie dx! s(x’). (12.13) 


a 


Using the other boundary condition given in Equation 12.9, this equation can be 
rewritten as 


dy(x) 
dx 


— yo + A(x)y(x) — A(a)yo 


+ i “a! |ous- 4 GAG 1, (x') = | ay (12.14) 


One more integration is necessary to remove the derivative in the first term. To do 
this, the variable x is changed to x” and the equation is operated on by 


| dx". (12.15) 


The dummy integration variable is labeled x" to avoid confusion with x’, the dummy 
variable of the first integration. This integration, and additional application of the 
boundary conditions, transforms Equation 12.13 to 


{ase & i] dx" AQx")y(x") — A(a)yo(x — a) 


a 


+ fa [ dx! [ae ty |, (x') (12.16) 
= pw" a dx! s(x’). 


Using the double integral identity 
| ax" [ dx! f(x') = / dx! (x — x f(x’), (12.17) 
a a a 
which is proven in Appendix C, this equation becomes 


y(X) — Yo — yy(x — a) + | . dx"A(x")y(x") — A(a)yo(x — a) 


+ is dx! (x — x’) Jaw 4 aa aie = [ dx'(x — x')s(x’). (12.18) 
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This equation can be rearranged as 


y(X) = Yo + yaa — a) + Ala)yo(x — a) + | : dx'(x — x')s(x') 


a 


+ fa{o-0]8@ ac) acd bre (12.19) 


dx! 


which is in the form of a Volterra equation of the second kind, like Equation 12.7. 
The unknown function is 


o(x) = yx), (12.20) 


the known function is 


f= yo + yh(x -— a) + A(a)yo(x — a) + i dx'(x — x')s(x'), (12.21) 


a 


and the kernel is 


dA(x') 
dx! 


k(x, x') = (x - | - ae — A(x’). (12.22) 


The solution to Equation 12.19 is identical to the solution of Equation 12.8, 
when the boundary conditions given by Equations 12.9 and 12.10 are applied. To 
work with the differential equation, the boundary conditions must be specified in 
addition to the differential equation itself. With this integral equation, however, 
no additional information is necessary to arrive at the same unique solution. The 
boundary conditions are built in. 


Example 12.1 As a specific example of the conversion of a differential equation to 
an integral equation, consider the differential equation for a harmonic oscillator, 


d*y(x) 


somes k2y(x) = 0, (12.23) 


with the boundary conditions 

y(0) = y(1) = 0. (12.24) 
Notice we cannot use the general result derived in the last section, because the form of 
the boundary conditions is different in this case. Here we have specified the unknown 


function’s value at two end points. Before, we specified the function and its derivative 
at a single point. 
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As we did before, we will change x — x’ and then apply the operator 


x 
i dx! (12.25) 
0 
to both sides of Equation 12.23 to give 
d Uh d tf x 
x a> oy al dx' y(x') = 0. (12.26) 
x dx = 0 


The (dy/dx)|) term on the LHS is a constant, but it is not one of the boundary 
conditions. We label it as the constant Yoo but remember that it is not yet known. We 
integrate again, by transforming x — x", and applying the operator 
x 
i dx" (12.27) 
0 
to both sides to obtain 


y(x) — y(0) — yax +: 2 i) dx” i dx'y(x') = 0. (12.28) 
o 0 


The quantity y(0) is one of the boundary conditions. Setting y(0) = 0 and using the 
double integral identity of Equation 12.17 gives 


y(x) ~ yox + 2 i dx' (x — x')y(x') = 0. (12.29) 
0 


We are not finished yet, because the constant yj is not known. Also, we have not 
made use of the second boundary condition at x = 1. These two things are taken care 
of at once by evaluating Equation 12.29 at x = 1: 


1 
yd)-— ye + a f dx' (1 — x')y@x') = 0. (12.30) 
0 
Because y(1) = 0, this equation allows y; to be evaluated as 
1 
y= i dx' (1 — x’)y(x’). (12.31) 
0 
Equation 12.29 becomes 
1 x 
y(x) = ax | dx'(1 — x')y(x') - ey dx! (x ~ x')y(x’). (12.32) 
0 0 


This is clearly an integral equation. All the differential operators have been removed, 
and both the boundary conditions have been incorporated. The unknown function 
appears both inside and outside the integral operations, but one integral has constant 
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limits and the other has the independent variable as an upper limit. This appears to 
be a mixture of a Fredholm equation of the second kind and a Volterra equation of 
the second kind. A bit more work clears up this confusion. 

The integration from x’ = 0 to x’ = 1 can be broken up into two parts. Because 
the independent variable x ranges from 0 to 1, Equation 12.32 can be rewritten as 


xX I 
y(x) = a | dx'x(1 — x')y(x') + ea dx' x(1 — x')y(x’) 
0 x 
—K i) dx' (x — x')y(x'). (12.33) 
0 
The two integrals from x’ = 0 to x’ = x can be combined: 


Xx 1 
ya) =k | dx'x'(1 — x)y(x') + k? | dx'x(1 — x')y(x’). (12.34) 
0 x 


Equation 12.34 is now in the form of a Fredholm equation of the second kind: 


1 
y(x) = kK? | dx'k(x, x')y(x’), (12.35) 
0 
with 
n_fxd-x) Os8%x's<x 
k(x, x') = e yh) ee gle ys (12.36) 


Notice that the independent variable x is a constant inside the integral operation, 
while the x’ is the variable of integration. 

The kernel for this problem is plotted in Figure 12.1. This kernel should look 
familiar to you. The differential equation for the string problem of the last chapter 
was solved using a Green’s function, which was essentially identical to the kernel 
described by Equation 12.36. The close link between this integral equation and the 
Green’s function solution isn’t too surprising when you consider Equation 12.23 from 
a Green’s function point of view. Rewrite Equation 12.23 as 


2 
¢ = = —ky(x). (12.37) 


K(x,x’) 


x 1 
Figure 12.1 Kernel for the Harmonic Problem 
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If the RHS of this equation is looked at as the drive, then the Green’s function solution 
for y(x) can be written as 


I 
y(x) = —ke [ dé g(xlé)y(é), (12.38) 


where g(x) is the Green’s function associated with the d*y(x)/dx? differential 
operator. That is, 2(x|é) is the solution of 


d?9(x]é) 


D2 7 8 © (12.39) 


with the same homogeneous boundary conditions as y(x) 
g(Olé) = g(1lé) = 0. (12.40) 


This Green’s function solution, Equation 12.38, is in the form of an integral equation 
for y(x). It is identical to the integral equation we obtained by converting the original 
differential equation, Equation 12.37, to an integral equation with 


g(xix’) = —k(x, x’). (12.41) 


12.3 METHODS OF SOLUTION 


In this section, several methods of solution for integral equations are presented. The 
different forms of integral equations generally require different solution techniques. 
For example, one takes a different approach for Volterra equations of the first kind 
than for Fredholm equations of the second kind. 

In these discussions, it is useful to define two classifications of kernels, which 
we dub “translationally invariant” and “causal” kernels. A translationally invariant 
kernel has the form 


k(x, x!) = k(x — x!). (12.42) 


The motivation for this terminology is fairly evident, when you consider the con- 
nection with the Green’s function kernels we discussed in the previous section. For 
a translationally invariant kernel, if the drive is displaced by a certain amount, the 
response will not change its character, but will just be displaced by the same amount. 
A causal kernel obeys the condition 


k(x, x’) = 0 x<x', (12.43) 


which again using the Green’s function analogy, means that a response does not occur 
before the drive. Several of our solution techniques require kernels which obey one 
or both of these conditions. 
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12.3.1 Fourier Transform Solutions 


Recall, the standard Fourier transform operation is defined by the pair of expressions 


1 se 
F(@) = —= dte™'™™ F(t (12.44) 
F(w) a des e f(t) 
f= = - _ doe Fo). (12.45) 


A Fourier transform approach works well for finding solutions to Fredholm equations 
of the first kind, which have both translationally invariant kernels and infinite limits 
of integration: 


f@)= ¢ dr k(t — r)y(7). (12.46) 


In this expression, f(t) and k(t — 7) are known functions, and y(t) is unknown. 
Assume that the Fourier transform of each of these functions exists: 


f@ — Flo) (12.47) 
k(t) + K(@) (12.48) 
yt) - Y(o). (12.49) 


Notice, because the kernel is translationally invariant, we can describe its argument 
using only a single variable. 

Because Equation 12.46 is a convolution of k(t) and y(t), its Fourier transform is 
easy to evaluate: 


F(@) = V 27 K(@)Y(@). (12.50) 
Solving for Y(w) gives 


Y(o) = —=>——_-. (12.51) 


The solution of y(t) can now be obtained with a Fourier inversion: 


Pa ee iot £() 
y(t) = on [dee Ko)" (12.52) 


12.3.2 Laplace Transform Solutions 


The Laplace transform approach works for solving Volterra equations of the first kind 
with causal, translationally invariant kernels and integration limits that range from 
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7=Otot=t: 
t 
f@O= i) dt k(t — t)y(7). (12.53) 
0 


Again, f(t) and the kernel are known functions, while y(t) is the unknown. We assume 
J (2), y(t), and k(t) are all zero for t < 0 and possess the Laplace transforms 


f(t) FS) (12.54) 
k(t) + K(s) (12.55) 
y(t) > Y(). (12.56) 


Recall that the standard Laplace operation is defined by the equation pair 
F(s) = j dte~* f(t) (12.57) 
0 


f= 54 7 dse"F(s), (12.58) 


with the Laplace contour J to the right of all the poles of F(s). 

As it stands now, the integral in Equation 12.53 is not a convolution integral 
because the upper limit of the integration is ¢ rather than infinity. This can easily 
be corrected, since we assumed a causal kernel. Because k(t — 7) = 0 fort < 7, 
we can extend the upper limit of the integral to +o without changing the value of 
the integral. Because y(r7) is zero for tT < 0 the lower limit can be extended to —« 
without changing the value: 


00 


f@= [ dt k(t — t)y(7) = i dt k(t — t)y(7). (12.59) 
0 


ao 


The integral equation is now in the form of a standard convolution and applying 
the Laplace transform operation to both sides gives 


F(s) = K()Y(). (12.60) 
Solving for Y (s) gives 
F(s) 
Y(s) = =——. 12.61 
Y(s) K@ ( ) 
The solution for y(t) is obtained with a Laplace inversion: 
1 Es) 
= ae 12.62 
yi) = 55 fase (12.62) 


In this expression, the Laplace contour must be to the right of all the poles of the 
quantity [F(s)/K(s)]. 
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12.3.3 Series Solutions 


Fredholm equations of the second kind can be solved by developing what is known as 
a Neumann series solution. In quantum mechanics, this process is called perturbation 
theory. The general form of a Fredholm equation of the second kind is 


b 
y(x) = f(x) +a J dx! k(x, x’)y(x’), (12.63) 
where f(x) and k(x, x’) are known functions, A is a constant, and y(x) is the unknown. 

The constant A is assumed to be “small” in some sense. To start, we will assume 
that ’ is small enough that 


Lf(x)| > (12.64) 


b 
af dx! k(x, x')y(x') 


foralla < x < b. This really does not pin down the magnitude of A very well, because 
the integral on the RHS of Equation 12.64 cannot be evaluated without knowing y(x) 
in advance. Notice how f(x) cannot be zero anywhere. If it were, Equation 12.64 
could never be satisfied, regardless of the magnitude of A. Colloquially, we describe 
this requirement by saying the method requires a nonzero “seed” function. 

If Equation 12.64 is satisfied, an iterative approach can be employed to obtain a 
solution for y(x) to arbitrary orders of A. As a first approximation, the solution can 
be given by 


yo(x) = f(x). (12.65) 


We call this the zero”"-order solution because A does not appear in this expression. An 
improvement on this approximate solution is made if we substitute the zero“"-order 
solution into the integral of Equation 12.63 and then solve for a new value of y(x): 


b 
yi(x)= f(x) + A if dx'k(x, x')yo(x') 
a, (12.66) 
=f(ix)+nr if dx'k(x, x') f(x’). 
We call y;(x) the first-order solution, because it has a term with A raised to the first 


power. Now you probably see what we are doing. To get the second-order solution, 
we substitute y;(x) back into Equation 12.63 and solve for a new y(x): 


b 
w= FO) +A , dx'k(x, x')y1 (2!) 
b 
= f(x)t+ af dx' k(x, x') f(x’) (12.67) 


b b 
+)? [ dx'k(x, x’) f dx" k(x', x") f(x"). 
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And of course, you guessed it, this is called a second-order solution because the 
highest power of A that appears is A”. This process can be carried to any order. The 
n'*order solution is 


b 
Yn(x) = f(x) +A / dx'k(x, x')yn—1(x') 
b 
= f(x)+ af dx'k(x, x") f(x’) (12.68) 
b b 
+ 0 | as’ [ dx" k(x, x k(x! x") f(x") 


b b b 
+ » | as! [ ax" [ dx ex, x! k(x! V(x", 20!" fl”) 
a a a 


b b 
+A" i dx'--- / Ax" k(x, x!) = ROD x) f(x”), 
a 


a 


In this expression x”’ is used to indicate an x with n primes. 
Now our hope is that 


y(x) = lim yn(x). (12.69) 


That is, if we keep performing the iterative process of substituting our approximate 
result for y(x) back into the equation, we will eventually converge on the correct 
solution. The standard convergence tests should be applied to check that the resulting 
infinite sum converges. Obviously, A must be small enough for this to occur. 


Example 12.2 Most engineering and physics problems formulate naturally as dif- 
ferential equations as opposed to integral equations. That is why more emphasis, in 
this book and in others, is placed on solutions to differential equations rather than 
integral equations. There are some problems, however, that do formulate naturally as 
integral equations. The quantum mechanical scattering problem is one. 

Consider an electron incident on a scattering center, such as the hydrogen nucleus 
sketched in Figure 12.2. The scattering center is modeled by an electrostatic potential 
V(®) that is a scalar function of position. The electron is modeled by a wave function 
(Fr), and its interaction with the potential is described by the Schrédinger wave 
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Figure 12.2. The Quantum Mechanical Scattering Problem 
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equation: 


2m 


h2V2 
|- + vo] wF) = Ew(r). (12.70) 


Here E is the energy of the electron and m is its mass. If we define the constant 
k? = 2mE/h?, Equation 12.70 can be rewritten as 


2m 
(V? + k?) wR) = Fz VO. (12.71) 


This last equation is a linear differential equation with y(F) as the dependent variable. 
A Green’s function approach to solving this equation gives the result 


wr) = elf F + a / a°F’ g(Flr’) VF) WwF’). (12.72) 


The first term on the RHS is the solution for (Fr) when V(fF) is zero, the homogeneous 
version of Equation 12.71. The second term is an integral over all space involving 
the Green’s function g(rlF’). The Green’s function itself is a solution to 


(V? +k’) e(rir’) = 8°r — F’). (12.73) 


It will be assumed that the Green’s function vanishes at infinity. Applying this bound- 
ary condition gives 
; eikir-¥'l 
rir’) = ———_... 12.74 
g(Flr’) aalF = FI (12.74) 
If w(r) were not sitting inside the Green’s function integral of Equation 12.72, we 
would already have our desired solution. But since it does appear both inside and 
outside of the integral, we resort to the Neumann series method described earlier. The 
potential V(r) is assumed to be a small perturbation to the problem. The zero-order 
solution is just the wave function for a free electron 


Wor) = eT, (12.75) 


We obtain the first-order solution by forcing this solution back into Equation 12.72: 
= ike _, 2m 3elorelel\yiepikF’ 
Wy (F) = e+ RD a’r'g(rlr)Vir ye". (12.76) 


This solution is referred to as the first Born approximation. Higher-order approxima- 
tions follow the Neumann series solution method. 

You may have noticed something a little odd about this example. In the derivation 
of the Neumann series, the constant A needed to be small enough for the series 
to converge. In this example, however, the constant A is nowhere to be seen. The 
convergence of this solution is controlled by the magnitude of V(F). 
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12.3.4 Separable Kernel Solutions 


Fredholm equations of the second kind can be solved by the method of separable 
kernels, if the kernel can be made equal to the sum of several products of two 
functions. In mathematical terms, the kernel is separable when 


N 
k(x,x') = S> Mj(x)N((2’). (12.77) 
j=l 


Notice M(x) and N;(x’) are functions of only one variable. This separation can be 
accomplished for a surprisingly large number of kernels, even, as we shall see, a 
translationally invariant one. 

If the kernel separates, we can write a Fredholm equation of the second kind as 


b N 
yx) = fay ta i) dx! S > Mj(x)Nj(x'Vy(x'). (12.78) 
a j=l 


In this equation, f(x) and the M;(x) and N,(x) functions are known, and A is a 
constant. Interchanging the order of integration and summation gives 


N b 
y(x) = f(x) +A > Mj(x) } dx'N,(x')y(x'). (12.79) 
j=l . 


Now this integration has no x-dependence. If we define 


b 
Cj = [ dx' N,(x')y(x'), (12.80) 


Equation 12.78 can be rewritten as 


N 
y(x) = f(x) + AS > Mj(X)C;. (12.81) 


j=l 


Keep in mind, the constants C; are still unknowns because y(x) is unknown. 

Equation 12.81 is in a mixed form. The unknown elements are both the explicit 
y(x) and the C;’s. The known elements are f(x) and the M;(x) functions. Some terms 
have x-dependence and some terms are constants. We can generate a more consistent 
form if we operate on both sides of Equation 12.81 by 


b 
i, dx N(x), (12.82) 
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which yields 


b b N b 
fi dx N;(x)y(«) = i dx N(x) f(x) +A ) c, | dx N;(x)M,(x). (12.83) 
a a j=l a 


According to Equation 12.80, the integral on the LHS is just the unknown constant C;. 
The first integral on the RHS can be defined as a new, known constant with an i 
subscript: 


b 
Bi = | dx N;(x) f(x). (12.84) 
a 
The last integral generates the known elements of a matrix [A]: 
b 
Aij = i dx Nj(x)M j(x). (12.85) 
a 


Equation 12.83 can now be written in terms of known and unknown constants as 


N 
C= B+ AS) CyAj (12.86) 
j=l 
or in matrix notation as 
[C] = [B] + A[A][C], (12.87) 


where the matrices [B] and [C] can be viewed as column matrices made up of the 
elements of B; and C;. 
Working in matrix notation, and rearranging Equation 12.87 gives 
[B] = {[1] — A{A}}IC], (12.88) 
where [1] is the N X N diagonal, identity matrix. Solving for [C] gives 
[C] = {[1} — ALA}} "BI. (12.89) 
To determine the elements of [C] requires an inversion of the matrix 
[1] — A[A]}. (12.90) 
Once these elements of [C] are determined, y(x) is generated quite simply using 


Equation 12.81: 


N 
y(x) = f(x) +A $5 CiMj(x). (12.91) 
j=l 
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This matrix notation points out a very interesting result for these types of problems. 
If f(x) = 0, then all the elements of the [B] matrix are also zero. Therefore, from 
Equation 12.88, it can be seen that a nonzero [C] matrix, and therefore a nonzero 
y(x), can exist only if 


I{1] — ALAllaer = 0. (12.92) 


Given the [A] matrix, Equation 12.92 will be satisfied only for specific values of A. 
A nonzero solution to the original integral equation will exist only for these A, 
eigenvalues, and the solutions y,(x) are eigenfunctions of the integral equation: 


b 
i} dx' k(x, x') yn(x') = yal (12.93) 


Example 12.3 _ It is possible to separate kernels that do not appear, at first glance, 
to be separable. As an example, consider the translationally invariant kernel 


k(t, t') = cos(t ~ t’). (12.94) 


This kernel can be separated with the trigonometric identity 


cos(t — t') = cos(t) cos(t’) + sin(t) sin(t’), (12.95) 
so that 
2 
cos(t — f') = > MON, (C'), (12.96) 
j=l 
where 
M(t) = cos(t) M(t) = sin(t) 
é (12.97) 
N,(t') = cos(t’) No(t') = sin(t’) 
EXERCISES FOR CHAPTER 12 


1. Derive the Fredholm integral equation that corresponds to 
d*y(x) 
dx? 


subject to the boundary conditions y(1) = y(—1) = 0. How does the integral 
equation change if the boundary conditions are changed to y(1) = dy/dx|, = 1? 


2. Convert the following integral equation to a differential equation: 


y(x) = | "axe xx')y(x'). 
0 
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Identify all the boundary conditions included in the integral equation, which 
must be specified to make the solution to the differential equation unique. 


3. Given the Fredholm equation of the second kind, 


1 
els / dt K(x,1)y(2), 
-! 


where 


_ fad-xa+1)/2 -l<t<x 
De esis ae 2 lie 


obtain the associated second-order differential equation and the boundary con- 
ditions. 


4. Convert the integral equation 
yx) =1l—x+ [ dt (x — tyy(t) 
0 


to a differential equation, with the appropriate boundary conditions. 


5. Convert the integral equation 


p(x) = 14 A? fa (x — Hod(t) 
0 


to a differential equation and determine the appropriate boundary conditions. 


6. Use Fourier techniques to solve the following Fredholm integral equation for 


P(x): 
er = [. dt p(tye &-, 
7. Use Laplace techniques to solve the following integral equation for (x): 
P(x) = x — ie dt (t — x)(x). 
8. Using the Neumann series approach, solve 


d(x) =1- 2 [dro 
0 


to obtain d(x) =e. 
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9. Consider the Fredholm equation 


ox) =A ij dy [1 + sin(x + y)]6(y). 


—T 


(a) Using the method of separable kernels, determine the values for A which 
allow solution for d(x). 
(b) Determine all the nonzero solutions for d(x). 


10. Consider the differential equation 
dy(x 
* + d2x’y(x) = 0, 


with the boundary condition y(0) = 1. 

(a) Convert this differential equation and its boundary condition to an integral 
equation. 

(b) Using the Neumann series approach, obtain a solution to the integral equation 
assuming small A,. Can you place this solution in closed form, i-e., not in 
terms of an infinite series? 


11. Solve 
1 
d(x) =x+ | dt (1+ xt)d(t) 
0 


by three methods, 

(a) the Neumann series technique. 
(b) the separable kernel technique. 
(c) educated guessing. 


12. Find the possible eigenvalues and the eigenfunction solutions of the following 
integral equations: 


1 
i. d(x) =A / dt (x — t)* (x). 
-1 
1 
ii. d() = af dt (t — x)(t) 
-1 
1 


lil, d(x) = af dt (xt + x)d(t). 
-1 


1 
iv. o(x) = af dt (1 + xt)d(t). 
0 
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ADVANCED TOPICS IN 
COMPLEX ANALYSIS 


This chapter continues our discussion of complex variables with the investigation 
of two important, advanced topics. First, the complications associated with multi- 
valued complex functions are explored. We will introduce the concepts of branch 
points, branch cuts, and Riemann sheets, which will allow us to continue to use the 
residue theorem and other Cauchy integral techniques on these functions. The second 
topic describes the method of steepest descent, a technique useful for generating 
approximations for certain types of contour integrals. 


13.1. MULTIVALUED FUNCTIONS 


Up to this point, we have concentrated on complex functions that are single valued. 
A function w(z) is a single-valued function of z if it generates one value of w for a 
given value of z. The function 


w=Z (13.1) 
is example of a single-valued function of z. In contrast, the function 

w = 2/2 (13.2) 
is an example of a multivalued function of z. This is easy to see, because z = 1 
generates w = +1 or w = —1. Another point, z = 2i, gives either w = V2e!*/* of 


w= V2 ein/4, 

In this chapter, we will use the convention that the vr operation is performed 
only on a positive real quantity to produce a single, positive, real result. Operations 
like ()'/? can have positive, negative, or complex arguments and produce multivalued 
results. 
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13.1.1 Contour Integration of Multivalued Functions 


In Chapter 6, special techniques were derived for performing integrals in the complex 
plane. Can we still use the residue theorem and the other Cauchy integral techniques 
on these multivalued functions? The answer is yes, if we are careful! 

From our previous work, we know that if the complex function w(z) is analytic 
inside a closed contour C, we can write 


f deme =0. (13.3) 
Cc 


The key requirement here is that the contour must be closed. For this path to be closed, 
the variable of integration z has to return to its starting value. For a single-valued 
function, this obviously also brings w(z) back to its starting value. The same is not 
always true with a multivalued function, and this is the source of some difficulties, 
as you will see in the two examples that follow. 


Example 13.1 The single-valued function 
wH=r (13.4) 


is analytic everywhere in the complex z plane. Remember, for a function to be 
analytic, we must be able to write w = u(x, y) + iv(x, y), with u(x, y) and v(x, y) 
being continuous, real functions that obey the Cauchy-Riemann conditions. In this 
case, 


u(x, y) = x* — y* 


v(x, y) = 2xy (13.5) 
and 

au _» — 

Ox 

er ono 

rs oy" 


So the Cauchy-Riemann conditions are satisfied for all z. If the closed contour C 
is the unit circle centered at z = 0, as shown in Figure 13.1, then according to the 
Cauchy integral theorem, 


f ded =0. (13.7) 
c 


Let’s be skeptical about this last result and explicitly perform the integration of 
Equation 13.7. On the contour C, z = e!® and dz = ie'®d@. To go around the circle 
once and return z to its initial value, we let the range of @ vary from 0 to 277. We could 
have made the limits on 6 different, perhaps 7 to 37r, but the starting and ending 
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Cc z-plane 


Figure 13.1 Contour for Unit Circle Integration 


values of @ must differ by 277 so the circle is traversed exactly once. The integral 
becomes 


= (1/3)(e" ~ 1) 
= 0, (13.8) 


which is exactly the expected result. The integral around a closed path of this analytic 
function is zero. There are no surprises here. 


Example 13.2. Now consider the multivalued function 
w= 23/2, (13.9) 


The single point z = 1 will give two different values for w: w = 1 andw = —1.A 
simple way to see this is to write z using an exponential representation 


z=1=¢% =e?" = e4..-, (13.10) 
Notice that all these expressions identify the point z = 1, but do so using different 


polar angles. Now use the fact that (e'®)3/2 = ¢3°/2 to obtain the corresponding values 
of w: 


ei? — elon = ella7.. =] 
w= or ; (13.11) 
et = eft = eilsa. .e = —] 


Two different values for w are generated by this process. 
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Is this function analytic? If we write z = x + iy, we have 
w= (x + iy”, (13.12) 
and with w = u(x, y) + iv(x, y), 
wiry) = 500 tw) = 5 [e+ oy? + | 
v(x, y) = zw -w)= = (« + iyy/? — (x - iy]. (13.13) 


The partial derivatives become 


m= Flat oy? + @- 7] 
5" + [@ + y'? - @- 97] 
2 = Fla +? -@- 07] (13.14) 
S = Gat? + @-y4], 


and so the Cauchy-Riemann conditions are satisfied for all z. This means the function 
given in Equation 13.9 is analytic everywhere in the z-plane, and consequently its 
closed line integral around the unit circle shown in Figure 13.1 should be zero: 


face” = 0. (13.15) 
c 
Again, let’s be skeptical about this result and check it explicitly by performing 


the same type of integration worked out in the previous example. As before, we set 
z= e9, dz = ie'®d@, and let 6 range from zero to 27: 


2a 2a 
| de ieee | de ie!99/2 
0 0 


Qa 
= 2 4150/2 
> ‘ (13.16) 
= ie ah 1] 
_ 4 
a 


Uh-oh! The Cauchy integral theorem appears to be violated. Our function is analytic 
inside C, but the integral is nonzero. Because the Cauchy integral theorem is the 
basis for complex integration using the residue theorem, its violation would be a big 
problem. Fortunately, there is a way to preserve the validity of the theorem. 
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Notice, although the 0 to 277 range of @ returns z to its starting value in the above 
integration, it does not return w to its initial value. When @ = 0, z = e = 1 and 
w = 1, but when 0 = 2a, z = e” = 1 and w = e?” = —1. But if we go around 
the unit circle twice, that is 0 < @ < 4s, both z and w(z) = z°/? will return to their 
starting values, and we obtain 


Aq An 
[ de ietcenp?= f dé je'5 8/2 
J0 0 
2 4 
a £ 2150/2 
5 


(13.17) 
0 


Zz... 
al = 1] 
=0. 


It appears that the definition of closed integral contour must take on new meaning 
when we are using multivalued functions. We cannot say a contour is closed unless 
both the variable of integration and the function being integrated simultaneously 
return to their initial values. We will investigate this idea in detail in the sections that 
follow. 


13.1.2 Riemann Sheets, Branch Points, and Branch Cuts 


Branch points, branch cuts, and Riemann sheets are all concepts developed so that 
the Cauchy integral theorem, the residue theorem, and all the other tools developed 
for analytic functions will still work with multivalued, complex functions. 


Riemann Sheets The nature of multivalued functions can best be explained by 
looking at their mapping properties. Consider the simple function, 


w(z) = 2/2, (13.18) 


It is easy to see this function is multivalued. The single point z = e'7/4 = e/97/4 maps 
onto two different points in the w-plane, w = e'7/8 and w = e97/8. This mapping is 
shown in Figure 13.2. 


z-plane w-plane 


Figure 13.2 Mapping of a Multivalued Function 
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1/2 


Figure 13.3 Mapping of w = z 


Let’s look at the mapping in more detail by determining how each of the points on 
a unit circle, centered around the origin of the z-plane, gets mapped onto the w-plane. 
On the circle, z = e”® and w = e'8/2 = e'*. This mapping is shown, divided into 
three steps, in Figure 13.3. At the start, we set 8 = 0,so z = 1 and w = 1. In the first 
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pair of pictures, we see that as the polar angle @ increases from 0 to 77, @ increases 
from 0 to 7/2. So w moves along its own unit circle, lagging z in angle by a factor 
of two. In the middle pair of pictures, @ continues to increase from 7 to 27, so that 
zreturns to z = e?” = 1, its initial value. However, w ends up at w = e'” = —1, 
because ¢ has only increased to 77. In the last pair of pictures, @ continues to increase 
from 277 to 477, repeating the loop it has already made, and once again returns to its 
initial value of z = e'4” = 1. At the same time, w finally makes a complete circle 
and returns to its initial value, w = e?” = 1. With the interpretation that a closed 
contour in the z-plane must return both z and the function w(z) to their starting values, 
the unit circle in the z-plane must be traversed twice in order to form a closed path 
for the function w = z!/2. 

To facilitate contour integration and other tasks that involve multivalued functions, 
it is useful to extend our concept of the complex plane to include what are commonly 
called Riemann sheets. Simply put, this is a way to force the mapping of a multivalued 
function to be one-to-one, so we can keep track of what contours are truly closed. 
Imagine, in the case of the function w = zi/ 2, the complex z-plane is composed of 
two different “sheets,” as depicted in Figure 13.4. The contour that z follows in Figure 
13.3 can be viewed not as the same circle twice, but one extended circle, which covers 
both sheets. In the z-plane, as 6 is increased from zero to 27, a circle is traced out 
on the upper sheet in the z-plane and maps to the upper half circle in the w-plane, 


w-plane 


Multisheeted z-plane 
Figure 13.4 Multisheeted z-Plane 
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w-plane 


Multisheeted z-plane 
Figure 13.5 Mapping of the Unit Circle with a Muitisheeted z-Plane 


as shown in Figure 13.5. Once @ is increased beyond 277, suddenly z “jumps” to the 
second sheet in the z-plane. Consequently, the z-plane contour has not closed on itself 
at this point. As @ is increased from 27r to 477, now tracing out a circle on the lower 
z-plane sheet, w traces out the lower half circle in its plane, as shown in Figure 13.6. 
Finally, once @ reaches 477, z jumps back to the upper sheet and the circular paths in 
both planes are complete. 

The picture, in Figure 13.2, of how a single point gets mapped from the z-plane to 
the w-plane is also modified when we are using Riemann sheets. The point z = ein/4 
is located on the upper Riemann sheet of the z-plane and maps into the point w = eit/8 
in the w-plane. The point z = e'9"/4, which is located on the lower Riemann sheet, 
maps onto the other point w = e7/8 of the w-plane. In this way the mapping is 
one-to-one, as shown in Figure 13.7. 

Notice that a one-to-one mapping has been accomplished with only one sheet for 
the w-plane. This works because the inverted function z = w” is not a multivalued 
function of w. In more complicated functions, multiple Riemann sheets may be 
needed in both the z- and w-planes. 


Branch Points In the analysis of the previous section, we demonstrated the ne- 
cessity of having two Riemann sheets by determining how the points along a closed 
contour in the z-plane were mapped on the w-plane. We will use this method so often 
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that connects the two branch points. It is for this reason that branch points must occur 
in pairs. They provide the end points for the branch cuts. If there is only a single 
branch point in the finite complex plane, it must be coupled with one at infinity. More 
will be said about relocating branch cuts in the examples that follow. 


13.1.3 Applications of Riemann Sheet Geometry 


Now that the fundamentals of branch points and branch cuts have been introduced, 
the details of Riemann sheet geometry are probably best understood by looking at 
examples of specific functions. The examples that follow discuss increasingly more 
complicated multivalued functions. In each case, we determine the location of the 
branch points and the number of Riemann sheets needed for a one-to-one mapping, 
locate the branch cuts, and make the Riemann sheet connections. 


Example 13.3 Consider the Riemann sheets associated with the function w = z'/?. 
For this function, each value of z generates three different values of w, so it’s likely 
that there must be three Riemann sheets to describe the z-plane. A tour in the z plane 
shows that z = 0 is a branch point, and it must be circled three times before w = zi/3 
returns to its starting value. This confirms that there must be three z-plane sheets. 


b 
a b 
ae: 
d 
c d 
c 


Figure 13.14 Riemann Sheet Connections for w = z' 
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that we give it a special name. The paths used to determine the multivalued nature 
of a function will be referred to as tours. With the function w = zif 2 we discovered 
a single tour around the origin returned z to its initial value, but did not return w 
to its starting value. Two successive tours, however, brought both z and w back to 
their initial values. This is why we needed two Riemann sheets in order to make our 
mapping one-to-one. 

Still using the function w = z!/*, imagine another tour, this time not around the 
origin, but around another point in the first quadrant, as shown in Figure 13.8. We 
label the starting point for the tour as P, with coordinates r, and @,. AS we move 
around this circle once, z = r e’® clearly returns to its initial value. But notice, w also 
retums to its starting value as well, because both r and @ have returned back to their 
initial values. This will be the case for any tour of this function that does not surround 
the origin. For this particular function, the origin appears to be a very special point. 

Points of this type, those around which a single tour does not return the function 
to its initial value, are called branch points. Branch points are isolated by reducing 
the radius of the touring circle to an arbitrary, small value. Usually the first task in 
making sense out of any multivalued complex integrand is to determine the locations 
of all its branch points. 

As you will see in the next section, branch points always occur in pairs. We 
have seen that the function w = z!/? has a branch point located at the origin in the 
complex z-plane. Tours with this function, like the one in Figure 13.8, show that 
there are no other branch points of w = z!/? in the finite z-plane. The second branch 
point for this function must be located at infinity. It may seem strange to call infinity 
a single point, but we can do so with the aid of the interesting mapping shown in 
Figure 13.9. This figure shows how points in the complex plane can be mapped onto 
points on the surface of a sphere of unit radius. The complex plane passes through 
the center of the sphere. To determine the point on the sphere that corresponds to 
a given complex number, draw a line from N, the north pole of the sphere, to the 
point in the complex plane. Where this line intersects the sphere is the mapped point. 
Points in the complex plane that lie outside the intersection of the sphere and the 
plane, such as the point P, shown in Figure 13.9, map onto the northern hemisphere. 


| Cc z-plane 


av «8 x 


Figure 13.8 


Displaced Unit Circle Contour 
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Figure 13.9 Mapping of the Complex Plane onto the Surface of a Unit Sphere 


Points inside this intersection, such as P2, map onto the southern hemisphere. Notice 
how the origin of the complex plane maps to the south pole of the sphere, while any 
infinity in the complex plane, that is +0, —00, +ic0, —ioo, etc. maps to the north pole. 
This single point at infinity is the second branch point of w = z'/?. With this picture, 
a small circular tour around the origin of the z-plane is equivalent to a small circular 
tour around the south pole. A tour around an infinitely large circle in the z-plane is 
equivalent to a small circular tour around the north pole. 


Branch Cuts The two Riemann sheets associated with the function z'/? have been 
described in Figures 13.4-13.7. The picture for the Riemann sheet geometry is not 
complete, however, until the sheets have been connected. This connection occurs 
where tours jump from one sheet to the other. The tour described in Figures 13.5 and 
13.6 was on a circle of constant radius. As the radius of this circle is changed, it can 
be seen that the jump between sheets occurs along a line that falls on the positive 
x-axis. This line forms what is referred to as a branch cut and is shown in Figure 
13.10. It is called a cut, because you can imagine taking a pair of scissors and actually 
cutting the z-plane sheets along this line to prevent a contour from crossing it. It can 
be seen that the cut goes between the two branch points of z'/2. It is made on each 
sheet of the z-plane, along the same line, as shown in Figure 13.11. The edges of the 
cut on one sheet have been labeled a-b and the edges of the other sheet have been 
labeled c-d. These edges are then connected as shown in Figure 13.12. As tours cross 
the branch cut they follow the connection and switch sheets. 

Figure 13.12 is quite clever because it simultaneously shows the nature of the 
z-plane from two different points of view. If the two z-plane Riemann sheets are 
stacked and viewed from the top, the point z = eit/ 4 on the top sheet, is exactly 
above the point z = e’°"/*, on the bottom sheet. Consequently, from this point of view 
these two points appear as the single point z = 1/ /2 + i/,/2. But when the sheets 
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Figure 13.10 A Branch Cut in the z-Plane 


Figure 13.11 Cut Riemann Sheets 
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Figure 13.12 Riemann Sheet Connections 


are viewed separately, z = 1/ 2 +if V2 becomes two distinct points, one on each 
of the z-plane Riemann sheets. With many functions, however, making a drawing 
like Figure 13.12 becomes impossible. In these cases, the more abstract method of 
indicating the connections shown in Figure 13.13 is more convenient. 

For the function zi/ 2 we have placed the branch cut along the positive x-axis, 
between the branch points at the origin and at infinity. The arguments associated with 
Figures 13.5 and 13.6 led to this conclusion. With a little thought, it should be clear 
that the jump between sheets, and therefore the location of the branch cut for this 
function, does not have to be along the positive x-axis, but could be along any line 
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Figure 13.13 Another Way to Show Riemann Sheets Connections 
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that connects the two branch points. It is for this reason that branch points must occur 
in pairs. They provide the end points for the branch cuts. If there is only a single 
branch point in the finite complex plane, it must be coupled with one at infinity. More 
will be said about relocating branch cuts in the examples that follow. 


13.1.3 Applications of Riemann Sheet Geometry 


Now that the fundamentals of branch points and branch cuts have been introduced, 
the details of Riemann sheet geometry are probably best understood by looking at 
examples of specific functions. The examples that follow discuss increasingly more 
complicated multivalued functions. In each case, we determine the location of the 
branch points and the number of Riemann sheets needed for a one-to-one mapping, 
locate the branch cuts, and make the Riemann sheet connections. 


Example 13.3 Consider the Riemann sheets associated with the function w = z'/?. 
For this function, each value of z generates three different values of w, so it’s likely 
that there must be three Riemann sheets to describe the z-plane. A tour in the z plane 
shows that z = 0 is a branch point, and it must be circled three times before w = zi/3 
returns to its starting value. This confirms that there must be three z-plane sheets. 
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Figure 13.14 Riemann Sheet Connections for w = z' 
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This function is so similar to w = zi/ 2 that it is fairly clear that the only other branch 
point is at infinity. There is a single branch cut connecting these two branch points. If, 
as before, the cut is made along the positive real axis, the connections of the Riemann 
sheets are as shown in Figure 13.14. If you start on the top sheet (sheet #1), a single 
loop around the origin drops you to the second sheet. A second tour drops you to the 
third sheet. A third loop returns you to the first sheet, and back to your starting value 
of w. This example points out the usefulness of the simple dot connection diagram 
for indicating sheet connections. It would be very difficult to make a drawing like 
Figure 13.12 for this function. 


Example 13.4 As the next example, consider the function w = Inz. This function 
is more conveniently handled with polar representation: z = re’? and w = Inr + i6. 
From this form, it is clear that all tours in the z-plane around any point other than the 
origin will return w to its initial value. A tour around the origin, however, will not. To 
see this, start a tour around the origin at some initial point with r = r; and @ = @;. 
The initial value of the function is w = Inr; + i6;. After a single lap around the origin, 
w = Inr; + i(0; + 27). After two laps, w = Inv; + i(@; + 477). No matter how many 
complete tours are made, w = |nz will never return to its initial value! Consequently, 
for this function, there are an infinite number of z-plane Riemann sheets. If the sheets 
are cut along the positive real axis, they are connected as shown in Figure 13.15. 


Example 13.5 Now consider the function 
w = (g ~ a’)'?, (13.19) 


where a is a positive real number. This example is quite a bit more complicated, 
and requires the use of multiple complex phasors to check for branch points. One 
way to approach a function of this nature would be to break it up into two or more 
simple intermediate functions. For instance, in this case you might let t = z* — a* 
so w = t!/2, Then you would analyze the two functions separately and combine 
the results. For this example, however, it is more instructive to work with the entire 
function directly. It does help to factor Equation 13.19 into two terms: 


w = (z- a)!?¢z + a)”, (13.20) 


Our first task is to find all the branch points in the z-plane. To accomplish this, we 
must perform lots of tours to determine where w does not return to its initial value. It 
is useful to define two phasors for the complex quantities z — a and z + a, as shown 
in Figure 13.16. These phasors are, in exponential notation, 


zta=nr,e% 


z-a=ne®, (13.21) 
With these definitions, w can be written as 


w= (ne)! (re%)!% Se 


Figure 13.15 Riemann Sheets for w = Inz 


z-plane 
y z 


—a : a 


Figure 13.16 Phasors for Touring in the z-Plane 
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y z-plane 


Figure 13.17 Tour Around z # +a 


or 


w= fri v/rp elt 60/2, (13.23) 


For the first tour, pick any point P, which is not one of the two points +a, as shown 
in Figure 13.17. At the start of the tour, z = z,, with the corresponding initial values 
for the phasor components being r1;, 0);, f;, and @;. A single counterclockwise 
tour around P returns both z and all these phasor components to their initial values. 
Consequently, w also returns to its initial value, and P is not a branch point of 
w= (2 = a’)'/2, 

Now let’s tour around the point z = a, again defining the starting position of the 
tour to be z = z,, as shown in Figure 13.18. The initial values for the phasors are 
again r\;, 91;, 72;, and @;. As z makes a counterclockwise loop around this point and 
returns to its initial value, 7}, 0,, and r, all return to their initial values, but this time 
02 has increased to 6; + 277. Consequently, w after one loop is 


w= lr; Toi eit 5+20)/2 


=~ Sri yf rx ef Out nd/2, (13.24) 


a. z-plane 
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Figure 13.18 Tour Around the Point z = a 
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which is a factor of —1 different from its initial value. This shows z = a is a branch 
point of w = (z2 — a”)'/2. A tour around the point z = —a produces a similar result, 
so the two points z = +a are both branch points of w. 

We have already shown there are no other branch points in the finite z-plane, but 
there is the possibility that these two branch points do not form a pair with a single cut 
between them. Instead, each of the branch points might have a separate cut extending 
out to infinity. In this case, you can visualize each of the two branch points paired 
with a different branch point at infinity. An easy way to determine if this is actually 
happening in this case is to make a large tour that encloses both branch points, as 
shown in Figure 13.19. If, at the end of this tour, we are on a different sheet from the 
start, we must have crossed some cut(s) extending to infinity. After performing this 
big tour, notice both r; and r, have returned to their initial values, but both 6, and 62 
have increased by 277, so w becomes: 


w= Vr Poi eiOut2a+t Oj +2m)/2 
= J ryin/ry ent 0? (13.25) 


which is exactly the initial value of w. We did not have to cross a cut, so the two 
branch points at z = +a must form a pair with a single cut between them. 

To determine the complete nature of the sheets we need to know the results of a 
few more tours. It is easy to show that touring around either of the branch points at 
z = <a twice returns w to its initial value. Also, going around one of the branch 


Figure 13.19 Tour Around Both Branch Points 
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Figure 13.20 Riemann Sheet Connections 


points once and then back around the other, in a figure-eight pattern, also returns w to 
its initial value. These facts lead us to conclude there are two Riemann sheets in the 
z-plane, and they are connected with a cut between them, as shown in Figure 13.20. 

This figure shows the cut placed along a straight line between z = +a. As we 
mentioned earlier, the cut can be placed anywhere as long as it terminates at the 
appropriate branch points. We could legitimately deform the cut, in successive steps, 
as shown in Figure 13.21. The configuration of the cut at the bottom of this figure 
makes it look like there might now be two branch cuts. This is not the case. It is 
actually the same cut making a very large loop around the complex plane. With the 
cut positioned as shown at the bottom of Figure 13.21, a tour enclosing both branch 
points crosses the branch cut twice. Because this is the same cut, however, the second 
crossing of the cut returns us to the original sheet. 


Example 13.6 As a last example, consider the function 
w = (z~-a)'? + (z+a)!”. (13.26) 


Although this looks very similar to the function in the previous example, it behaves 
quite differently. To find the branch points, tours must be made in the z-plane. The 
same phasors that were used in the previous problem, and shown in Figure 13.16, 
can be used again here: 

zta=ne% 


z-a=ne™, (13.27) 

With these phasors, the function can be rewritten 
w= fre? + fr ei®/?, (13.28) 
In a single, counterclockwise lap around z = a, r;, 6; and rz return to their initial 


values. The angle 62, however, increases by 27. For simplicity, take the initial values 
of 6, and @, to be zero, and the initial radial values to be r; = rj; and r, = r;. Then 
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Figure 13.21. Deformation of the Branch Cut 


at the start of this tour, 


w= frit Jnr (13.29) 


but at the end 


w= ry Tj. (13.30) 
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A tour around z = ~a produces a similar result, so both points at z = a are branch 
points. Single loop tours around any other point in the finite z-plane will return w to 
its starting value, so there are no other branch points in the finite z-plane. 

As was the case in the previous example, we must check to see if there are any 
branch points at infinity by making a tour around both z = a and z = ~—a. Start the 
tour with both @; and 6 equal to zero. At the end of this tour, r; and r) will return to 
their initial values, but both @, and 9 will increase by 27. Thus, at the start of this 
tour 


w= Jn t Sri, (13.31) 


while at the end 


=~ Sri - Sra. (13.32) 


Consequently, there must be two branch points at infinity, and the branch cuts that 
start at z = a and z = —a extend to infinity, as shown in Figure 13.22. Unlike the 
single cut, with the misleading appearance of two cuts, shown at the bottom of Figure 
13.21, these really are two distinct cuts extending out to infinity. 

We can determine the total number of Riemann sheets and how they are connected 
at the cuts by performing a few more tours, and watching how many different values 
of w are generated at the same point of z. Consider the three different tours shown 
in Figure 13.23. These tours all start with the same value of z. This value, which we 
will call z, = r; e’*, is a pure real quantity, lying just above the cut with 6; = 0 and 
to the right of z = a. In terms of the phasor parameters, at the start of each of these 
tours 


w= Jn t JSri- (13.33) 


We already determined that at the end of the topmost tour 


w= Jr — fri (13.34) 


z-plane 


Figure 13.22 Branch Points and Branch Cuts for w = (z — a)!/? + (z + a)? 
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Figure 13.23 Tours to Determine the Number of Riemann Sheets 


which is different from the initial value, so at the end of this tour we must be on a 
second Riemann sheet. Now consider the tour shown in the middle of Figure 13.23. 
After one lap around this path, only 6, has changed by 277, and so 


we Sri t Sra 


(13.35) 
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This is a third value for w, different from both Equation 13.34 and 13.35, so at the 
2nd of this tour, we must be on a third Riemann sheet. A single lap around the tour 
shown on the bottom of Figure 13.23 changes both 6, and @ by 27. At the end of 
his tour 


w= —J/ni- Sra. (13.36) 


This is a fourth value for w and consequently requires a fourth Riemann sheet. 

Any combination of these tours or multiple laps of a single tour will not generate 
any new values for w, and so we conclude that there are four Riemann sheets. Because 
a two-lap tour around either of the branch points at z = +a would return us to the 
original sheet, but a figure-eight pattern would not, the connection diagram must be 
as shown in Figure 13.24. 


Figure 13.24 Riemann Sheet Connections 
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A quicker way to see that there should be four Riemann sheets for this function is 
to realize that the + property of the square root operation, i.e., 


w= ti /ntJ/n, (13.37) 


allows four different possible combinations for forming w. This method is useful for 
determining the number of sheets, but does not guide you in determining how they 
are connected. In our previous example, with w = (z — a)!/2(z + 2)'/?, the square 
roots have only two combinations for w, 


w= ti /nJn, (13.38) 


and so there were only two Riemann sheets associated with this function. 


13.1.4 Riemann Sheet Mapping 


Quite often, it is useful to see how an entire Riemann sheet gets mapped from one 
plane onto another. This is quite easy for the function w = z'/?. If we define z = r,e 
and w = r,e'?, we can write 


ta = aft (13.39) 
d= > (13.40) 


According to Figure 13.10, we are on the first Riemann sheet if 0 < 6 < 27. 
Therefore, the values of w we can generate from the first sheet will be limited to the 
angular range 0 < @ < 7, or equivalently the upper half plane. Likewise, the second 
Riemann sheet will map onto the lower half plane of the w-plane. This situation is 
shown in Figure 13.25. 

Now remember, we said the actual placement of the branch cut is arbitrary, as 
long as it connects the branch points. For this function, the branch cut could equally 
well have been placed along the positive imaginary axis of the z-plane. In that case, 
the z-plane sheets would map onto the w-plane as shown in Figure 13.26. Now the 
first Riemann sheet is associated with the range 77/2 < @ < 57/2, and the second 
sheet with 52/2 < 0 < 92r/2. In the w plane, the dividing line has been rotated by 
at/4. Keep in mind, there is nothing that says the branch cut has to be a straight line. 
It could be placed as shown in Figure 13.27. In this case, it is impossible to define 
the different Riemann sheets of the z-plane using a simple range of 6. Don’t make 
the common mistake of assuming a Riemann sheet is always defined by an angular 
range. 

It is possible that, in order to get a one-to-one mapping, there must be multiple 
sheets in both the z- and w- planes. Consider the function w = z*/?. We investigated 
this function briefly at the beginning of this chapter. We showed there were two 
values of w for a single value of z, so there must be two z-plane Riemann sheets. 


i 
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Top sheet of the z-plane 
Figure 13.27 Arbitrary-Shaped Branch Cut 


Quite similar to the square root function, tours in the z-plane reveal two branch points, 
one at z = 0 and one at z = ©. In this case, however, the inversion of this function 
z = w* is also multivalued. Each value of w produces three different values of z, so 
there are three Riemann sheets in the w-plane. Tours in the w-plane show that there 
are two branch points, one at w = 0 and one at w = ©, If the cuts in both planes are 
taken along the positive, real axes, the z-plane sheets map onto the w-plane sheets, as 
shown in Figure 13.28. The branch cut in the z-plane maps onto the real axis of the 
second Riemann sheet of the w-plane. 


13.1.5 Integrals of Multivalued Functions 


Remember, the primary reason we have spent all this effort defining branch points, 
branch cuts, and Riemann sheets is so we can use the Residue theorem and other 
Cauchy techniques to perform integrals on multivalued complex functions. 

Consider a contour integral in the complex z-plane of our favorite multivalued 
function: 


i dzz'/?, (13.41) 
Cc 


From our previous experience with this function, we know there are two Riemann 
sheets, and two branch points in the z-plane, one at z = O and the other at z = ~. 
For purposes of this discussion, let C extend across the lower half of the first sheet 
of the z-plane, so that the contour looks as shown in Figure 13.29. The integrand in 
Equation 13.41 is analytic everywhere except at z = 0, and so the contour C can be 
deformed without changing the value of the integral, as long as it is not moved across 
z = 0. Let’s attempt to move it up, into the upper half plane. As the contour is moved 
up it is snagged by the nonanalytic point at z = 0. But that is not all that happens. 
The left half of the contour, along which the real part of z is less than zero, moves up 
with no problem. But as the right half of this contour is moved across the positive real 
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Figure 13.30 Contour Deformation with Two Riemann Sheets 


axis, the deformed contour runs into the cut and must switch to the second Riemann 
sheet, as shown in Figure 13.30. 

Actually, it is possible to deform C and stay on a single Riemann sheet. Because 
the location of the branch cut is arbitrary, as long as it still connects the branch points 
we can modify it to extend along the positive imaginary axis, as shown in Figure 
13.31. Now as the contour C is raised into the upper half z-plane, it is deformed as 
shown in Figure 13.32. 
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Figure 13.31 Integration with Repositioned Branch Cut 
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Figure 13.32 Contour Deformation Remaining on a Single Riemann Sheet 
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Figure 13.33 Integration Along the Branch Cut and Around the Branch Point 


Continuing with this second approach to the deformation, consider the portion 
of this integration shown in detail in Figure 13.33. It is tempting to claim that the 
two vertical portions of this contour can be made to cancel each other, because they 
follow the same line, but in opposite directions. This would be the case if it weren’t 
for the branch cut, which causes the function z!/? to have different values on either 
side. To see this, let 


z= re’, (13.42) 


with 8 = 0 being on sheet #1. Along the vertical segment just to the right of the 
branch cut, 9 = 7/2 and 


yf _ Vr eit/4 = yVr/2(1 +i) 0<r<~o, (13.43) 
Along the vertical segment just to the left of the branch cut, 9 = —37/2 and we have 
Zi? = Jr eal = —V/r/2(1 +i) o>r>d. (13.44) 


The value of z'/? on the left-hand side of the cut is not equal to its value on the 
right-hand side, and so the integrals along the vertical sections do not cancel. 

We should also look carefully at little circular part of the integral around the 
branch point in Figure 13.33. To perform this integration define z = ee’®. Then as 
the circular contour is shrunk around the branch point, 


7? = lim Je ei8/? —3n/2<0< 2/2. (13.45) 
€— 


Consequently, as « —> 0, the integrand also goes to zero and the contribution from 
this circular segment is zero. In general, however, the integral around a branch point 
will not be zero and its contribution must be carefully evaluated for each particular 
situation. 

These ideas are developed in the two examples that follow. 
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Figure 13.34 Complex Plane Picture for Integration of Example 


Exampie 13.7 As an example of how contours involving multivalued functions can 
be deformed in the complex plane, consider the integration 


2/2 
[a Gn" (13.46) 


where z, is some point in the complex plane, and C is the straight line contour 
on the first sheet, shown in Figure 13.34. The multivaluedness of the integrand is 
controlled by z!/? in the numerator. The integrand can, as before, be represented on 
two Riemann sheets with branch points at z = 0 and z = ©, and a single branch 
cut between them. The addition of the term in the denominator does not change the 
arrangement of branch points or the nature of the Riemann sheets, but it does add 
a nonanalytic, second-order pole on both sheets, at z = z,: The branch cut is taken 
along the positive imaginary axis in anticipation of the contour deformation we are 
about to perform. 

A clever transformation of this integral can be arrived at by deforming C into the 
upper half plane as far as possible, while trying to stay entirely on the first Riemann 
sheet. As we do this, the contour “snags” on both the branch point and the pole at 
z,- This situation is shown in Figure 13.35. As the imaginary part of the horizontal 
sections of C’ approach +i, their contribution to the integral vanishes, because the 
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Figure 13.35 Deformed Integration Contour 


MULTIVALUED FUNCTIONS 539 


Sheet #1 


Figure 13.36 Final Contours for Evaluating Integration 


integrand is approaching zero at a rate proportional to r~3/2, The two vertical sections 


extending down to z, also cancel, because the integrand has the same value on these 
segments and they are in opposite directions. From our previous discussion, however, 
we know the integration along the vertical sections on either side of the branch cut do 
not cancel, and the contribution around the branch point must be evaluated carefully. 
The contour shown in Figure 13.34 is thus equivalent to the sum of the two contours 
shown in Figure 13.36. 


Example 13.8 Now let’s work a specific, practical problem. Consider the evaluation 
of the real integral 


7 1 
= —_—-—_. 47 
1 er aresy (13.47) 


Remember our convention that the Vv symbol always gives us the positive root, so 
we should obtain a positive result for this integral. 

Our first step is to convert this integral into one in the complex plane by substituting 
z for x, and then determining an appropriate contour. The integral can be written as 


1 
I= [amex 7 (13.48) 


1/2 


The integrand is multivalued because of the z'’~ term, and so there are two Riemann 
sheets, the branch points at zero and infinity, and the single branch cut between them. 
We place the branch cut on the positive real axis. To properly correspond to the real 
integral, the complex contour C must also run along the real axis, directly parallel to 
the cut, on one of the two sheets. To determine which one, let 


xX Z= re? (13.49) 
Along C, we must have either @ = 0 (on the first sheet) or 6 = 27 (the second 
sheet). But notice if 9 = 277, we get the negative root in the denominator, which we 


explicitly said we did not want. Thus we take the contour to be on the first sheet, as 
shown in Figure 13.37. 
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Figure 13.37 Starting Contour for Integral Evaluation 


We are left with evaluating the integral in Equation 13.48 along the path shown in 
Figure 13.37. This integration can be evaluated by considering integration around the 
cleverly closed contour C, on sheet #1, shown in Figure 13.38. This contour is indeed 
closed, because we have made a special effort to stay on the first sheet by avoiding 
the branch cut entirely. The integration around C, is easy to evaluate, because the 


integrand is analytic everywhere inside C, except at z = —1. Therefore, 
d oa ea Qa : =2 13.50 
5 2URG + 1) a mR oes (13.50) 


z=e'™ 
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Figure 13.38 Equivalent Closed Contour Integration 
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Figure 13.39 The Four Parts of the C, Contour 


It is important to notice that the residue was evaluated at z = e!” and not at z = e3”, 
because the integration is being performed on sheet #1. 

The problem now becomes one of relating the outcome of this integration back to 
the original integral of Equation 13.48. The integration around C, can be broken up 


into four parts: 
gef+fefef (13.51) 
5 1 J2 J3 J4 


where the paths 1-4 are shown in Figure 13.39. Along path 1, @ = 0, z = re’, and 
dz = dr so 


1 
= gs ee 13.52 
i | a5 ae 


The integral along path 1 is the same integral as in Equation 13.47, the very thing we 
hope to determine. Along path 2, we are moving around a circle of constant radius 
R, soz = Re'® and dz = iRe'°d6, with @ ranging from 0 to 27. As we take the limit 
as R goes to infinity, path 2 contributes 


2m ef 
= ml 
I Rom aa +1) 


(13.53) 
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Along path 3, @ = 27, while r ranges from infinity to zero, so z = re®?™ and 
dz = re’*"dr. The integral on path 3 becomes 


ett 
13.54 
[- [« " Vrem(re®™ + 1) +1)’ ( ) 


which can be rewritten as 


si 1 
= | dr———— =I. 13.55 
[ [ "Vrir +) , ee) 


Along path 4, we want to integrate around the branch point with a tiny circular 
contour whose radius will shrink to zero. If the radius is e, along the contour z = ee’®, 

dz= iee!°d@ and, because we are going clockwise this time, @ ranges from 277 to 0. 

This integral becomes 


f- “ta Sega D +1) 


(13.56) 


So the integration around the branch point vanishes. Remember this will not always 
be the case. There is no rule saying the integral around a branch point must vanish. 
Combining the results of these four integrals with Equation 13.50 gives 


f aie [+ [= (13.57) 
Ce 1 2 


Therefore, our final result is 


(13.58) 


fea: 
0 JVx(x +1) 


13.2. THE METHOD OF STEEPEST DESCENT 


The method of steepest descent, also known as saddle point integration, is a technique 
for approximating line integrals of the form 


I= [at (13.59) 
c 


where C is a path in the complex z-plane, as shown in Figure 13.40. The method 
relies on analytic contour deformation to manipulate C so that only a small section 
on the contour actually contributes anything significant to the value of the integral. 
This deformation might be as shown in Figure 13.41. The modified contour C’ passes 
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iy 2-plane 


Figure 13.40 Initial Contour 


through a point Z,» where | f(z)| has a steep maximum which dwarfs the values of f (2) 
at the other points on the contour. The path C’ is called the path of steepest descent 
if no other contour can be drawn which has | f(z)| decreasing more rapidly as one 
moves away from the maximum. Along this path, the integral can be approximated 
by the value obtained by integrating along C’ for only a short distance near z,, 


z,+€ 
1~ / dz f(), (13.60) 
z7€ 


as indicated in Figure 13.42. 


ce 


Distance Along Ce 


Figure 13.41 Steepest-Descent Contour 
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Figure 13.42 The Steepest-Descent Approximation 


13.2.1 Finding the Local Maximum 


To emphasize the steepness of the integrand and facilitate finding the local maximum, 
the integrand fi (z) is placed in the form 


f@ = sQe™, (13.61) 


where g(z) is supposed to be a relatively slowly varying function of z, while the 
exponential term e%® changes magnitude rapidly. Given any f(z), the selection of 
g(z) and w(z) is not unique. You can always choose 


a = 1 (13.62) 
and 
w(z) = In{f(z)]. (13.63) 


In some cases, however, other choices are more appropriate. 
Because w(z) is a complex function, we can write it as the sum of its real and 
imaginary parts: 


wz) = u(x, y) + iv(x, y), (13.64) 
so that 
lf@l = Ie@le*. (13.65) 


Because g(z) is varying slowly, the magnitude of f(z) is controlled predominately 
by u(x, y). Thus, a maximum in the magnitude of f(z) will occur at the maximum 
of u(x, y), but will be much steeper due to the effect of the exponential function. 
Because u(x, y) is a function of two variables, its extreme points occur when both 


—=0 (13.66) 
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Figure 13.43 Local Maximum 


and 
— = 0. (13.67) 


If the values of x and y that allow the simultaneous solution of Equations 13.66 and 
13.67 are x, and yo, the location in the complex z-plane of an extremum of u(x, y) is 
given by Z, = Xo + tyo. 

To determine the nature of a particular extreme point z, = x, + iyo, the second 
derivatives of u(x, y) must be investigated. If we define un. = u/dax*, Uy = 
0?u/dy", and uxy = d?u/dxdy, there are four general possibilities (see for example, 
Thomas and Finney, Calculus and Analytical Geometry): 


Ux, < O and uz.uyy — ury >0 is a maximum 


% 


Ux, > O and uxxUyy — Ury >0 Ze is a minimum 


UxxUyy ~ Uy <0 z, is a saddle point 
UxxUyy — Ury =0 cannot be determined. (13.68) 


You are no doubt familiar with the first two cases, which are simply local maxima 
and minima, as shown in Figures 13.43 and 13.44. The third case might be something 
new to you. For functions of two variables, it is possible to have an extremum which 
looks like a maximum in one direction, but.a minimum from another. This type of 


Figure 13.44 Local Minimum 
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Figure 13.45 Extremum for Analytic Functions 


extremum, is called a saddle point and is shown in Figure 13.45. The fourth condition 
listed in Equation 13.68 says that if the extreme point falls into this category, it is 
impossible to determine the nature of the extremum simply by looking at second 
derivatives. Higher-order terms must be considered. 

Usually in contour integral problems, we are dealing with functions which are 
analytic everywhere, except at isolated points. If we impose the condition that w(z) 
be an analytic function, and consequently obeys the Cauchy-Riemann conditions, an 
interesting requirement is imposed on the extrema of u(x, y). The Cauchy-Riemann 
conditions specify that 


ov 
—_— = — 13.6 
Ox oy ( ?) 
and 
Mee (13.70) 
oy Ox 
This means that 
au o*u 
— 5 7 
at oe 0 (13.71) 


Therefore, if ux, > 0, then uy, < 0, and vice-versa. Looking back at the conditions of 
Equation 13.68, this means all the extrema for an analytic function fall into the third 
or fourth categories. Those in the third category are all saddle points. The ones in the 
fourth category occur when u,, and uy, are both zero. These also turn out to be saddle 
points of a more complicated type. We defer the discussion of these cases until the 
end of the chapter. When applying the method of steepest descent to a saddle point, 
the deformed contour needs to cross the saddle such that the contour goes through a 
maxima at z,, not a minima. 
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We already determined the extrema of w(z) are located where x and y satisfy 
Equations 13.66 and 13.67. If we are dealing with analytic functions, these equations 
also imply that dv/dx and dv/dy will also be zero at these points. Therefore, a simpler 
way to determine the location of the saddle points of analytic functions is by using 
the expression 


=0 at the saddle points. (13.72) 


13.2.2 Determining the Path of Steepest Descent 


The path of steepest descent is the path through the saddle point that makes the peak of 
the curve, plotted in Figure 13.41, as narrow as possible. One simple way to visualize 
this is to think of the area around the saddle point as a mountain pass, and the path of 
steepest descent the shortest way through the pass. Alternatively, it is the path a skier 
would take to descend from the saddle point the fastest. Notice, the course the path 
takes is dependent on the details of the topology. It does not necessarily follow lines 
of constant y or constant x, and in general is not even a straight line. 

A useful way to picture the path of steepest descent is on a two-dimensional contour 
plot, such as the one shown in Figure 13.46. Lines of constant u(x, y) are plotted, with 
the solid lines used for u(x, y) > u(Xo, Yo) and dashed lines for u(x, y) < U(X», Yo). 
The saddle point (SP) is located at (x, yo). The path of steepest descent passing 
through the saddle point has also been plotted. Notice how the path of steepest decent 
always intersects the dashed lines of constant u(x, y) at right angles. 


13.2.3 Evaluation of the Integral 


Now we will attempt to approximate the contour integral along this path by only 
considering the contribution of a short section of the path around the saddle point. 


u(x,y) > u(x, y,) 


~-- u(x,y) <ux,y,) 


Figure 13.46 Lines of Constant u(x, y) 


548 ADVANCED TOPICS IN COMPLEX ANALYSIS 


y 
\ / / Be aie 
St o- j A =e 
wee eo “ a 
ea sf ae ity 
a SA ee wee / / a 
a : Ohta ee 


Figure 13.47 Contour for the Steepest-Descent Approximation 


Near the saddle point, the path can be approximated by a straight-line segment, 
tangent to the actual path of steepest descent at the saddle point. This is shown as the 
path C” in Figure 13.47. So the original integral, Equation 13.59 evaluated on the 
contour C, is exactly equivalent to the integral along C’ of Figure 13.41, as long as 
the integrand is analytic between C and C’. This value is approximately equal to the 
integral along C” of Figure 13.47: 


= [at@ i. [4zt@ a [aso (13.73) 


It is useful to modify the form of the integrand of Equation 13.61 slightly, by adding 
a constant factor s > 0 in front of the function w(z). This makes the integration we 
want to perform a function of s: 


I(s) = | dzg(zje™. (13.74) 
c# 


Essentially, the value of s is a measure of the validity of the approximation. As s 

increases, the steepness of the integrand’s peak also increases, making the integral 

along C” a better approximation to the integration along the original contour C. 
Along the short contour C”, the value of z can be written 


oe ae Se, (13.75) 


where z, is the saddle point position, 6 is a small displacement, and a, is a constant 
angle. If the length of C” is 2e, as shown in Figure 13.48, then 5 ranges from —e 
to +€ as z moves between the end points of C ". The value of a, is the angle, with 
respect to the x-axis, that makes C” tangent to the path of steepest descent. We will 
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Figure 13.48 Definition of the Angle a, 


present a method for determining this angle shortly. We can now change the variable 
of integration of Equation 13.74 from z to 6 by substituting Equation 13.75 and using 


dz = ed. (13.76) 
The approximation in Equation 13.74 becomes 


+€ ; 
I(s) = e’% ‘A d5 a(z,, + Bel ep =F), (13.77) 


€ 


Since we assumed g(z) to be a slowly varying function, it can be approximated by 
a constant value g(z, a3 for the C” integration and therefore can be pulled outside of 
the integral. When a Taylor expansion for w(z) is made about Z,5> we obtain the series 


dw 1d’w 
We) = wz) + Fo) Cm tpt aaa| & oe ee (13.78) 
~ Esp ~ dsp 
which, written in terms of 5, becomes 
Se!) = qe Sel)7 + --- 13.79 
w(Z,, + de) = w(Z,,) Ide ( ) ‘ (13.79) 


2sp 


Notice because dw/dz| z,, = 0, the lowest-order 5-dependence is 57. If we approxi- 
mate the function w(8) by taking only terms up to the lowest order in 6, and combine 
this with the constant approximation for g(z), we obtain 


62 e!2a0 


sp : (13.80) 


Be 
I(s) = gape? i dde 


TE 
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This result assumes that the second derivative, which appears in the exponent of the 
integral, is not zero. Later in the chapter, we will briefly discuss a situation where 
this is not the case. 

The only task which remains is to determine the angle a,. The path of steepest 
descent follows the line which is perpendicular to the line of constant u(x, y) that 
passes through the saddle point. From our previous work with analytic functions of 
complex variables, we know that this path is also along a line of constant u(x, y), 
because the lines of constant v(x, y) are always perpendicular to lines of constant 
u(x, y). This implies that, along the path of steepest descent, the imaginary part of 
w(z) is a constant equal to its value at the saddle point. Consequently, on the path of 
steepest descent, where we have approximated w(z) by 


1 d’w 
WG) + >> (dei), (13.81) 


2sp 


the imaginary part of w(z) is entirely contained in the w(z,,) term and 


(Se!) (13.82) 


must be a pure real quantity. Since 6 is a pure real quantity, that means that 


ett (13.83) 


must be a pure real quantity. In general, d?w/dz* lz, , is acomplex quantity, which we 
write using the exponential representation as 


d*w ; 
qe = Wire'®. (13.84) 
2 le 
In order for Equation 13.83 to be pure real, 
0, + 2a, = tna n= 0,1,2,.... (13.85) 


If nis an odd integer, Equation 13.80 becomes 


I(s) ~ g(z,,)e'% en) / de 2W0% (13.86) 
~€é 
while if n is zero or an even integer, 


© 
I(s) ~ (zee) / db eti¥ee (13.87) 


=€ 
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Because we required s to be a positive real quantity, the integrand of Equation 13.86 
drops off as 5 moves from zero, that is, as z moves away from the saddle point. 
Therefore, this equation is an approximatation for the integration along the path of 
steepest descent. In contrast, the integrand of Equation 13.87 grows as 5 moves from 
zero, and approximates the integration along a path of steepest ascent. This is not the 
path we want. With n = 1, a, must be given by 


—-0, 47 
2 


The other odd values of n in Equation 13.85 will not generate any new values for 
Qo. The ambiguity of the + sign must be determined by the direction taken by the 
undeformed contour C and the topology of u(x, y), which controls the way C is 
deformed to C’. The deformation must be such that C’ stays in the “lowland,” except 
when it crosses the saddle point. 

Equation 13.86 is almost the final result. The effect of the s parameter is now 
clear. The integrand of this expression is a Gaussian whose width is controlled 
by s. The larger s becomes, the narrower the Gaussian spike becomes, making the 
approximation more valid. This effect allows one last simplification of the result. 
Tf the Gaussian is narrow enough, the contributions from extending the limits of 
integration to minus and plus infinity will be negligible. With this approximation, the 
integration in Equation 13.86 can be evaluated to give the result 


, 2a 
I(s) ~ [eg emenre| { | wir (13.89) 


After all these approximations, you might wonder if the result could possibly 
bear any resemblance to the actual value of the original integral. If s is large enough 
and the deformation to a path of steepest descent can be accomplished, the approx- 
imations turn out to produce, in many cases, amazingly accurate results. This will 
be demonstrated in the examples that follow and in the exercises at the end of this 
chapter. 


a = 


(13.88) 


Example 13.9 One frequent application of the method of steepest descent is the 
evaluation of Laplace transforms and their inverses. For example, the transform 


F(s) = | at fe ™ (13.90) 
0 


can be approximated using the method of steepest descent for positive real values 
of s. The first step is to place the integral in steepest descent form, i-e., 


dzg(zje™. (13.91) 
iC 


To accomplish this, convert the transformation to an integral in the complex plane 
by letting ¢ become z. The initial integration is then changed into a contour integral 


552 ADVANCED TOPICS IN COMPLEX ANALYSIS 


xX 


Figure 13.49 Contour for Laplace Transforms 


along the positive real z axis, 
Ee) = [ defwe®, (13.92) 
c 


as shown in Figure 13.49. 
Now we need to identify the two functions g(z) and w(z). There are several 
possibilities. Probably the simplest approach is to set g(z) = 1 and 


fe! = eX (13.93) 


so that 


w(z) = ati@) (13.94) 


A more general approach would allow f(z) to be broken up into two parts, a slow- 
varying part 9(z) and a fast-varying part h(z). Then we would set 


I = g@r2), (13.95) 
with 


e Inf{A(z)] — sz 
err eee 


w(Z) (13.96) 


Sticking with our original choice of Equation 13.94, the Laplace transform integral 
becomes 


F(s) = [ dzet 3, (13.97) 
Cc 


The next step is to find the saddle point. This is done be setting dw/dz = 0 and 
solving for z. In this case, the saddle point is determined from the condition 
df(Z) _ 

z 


TE = sf@. (13.98) 
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To complete the approximation, we need to determine the second derivative of w(z) 
at the saddle point, and identify the values of W,” and a, to use in Equation 13.89. 
These steps are straightforward, but the details are of course determined by the nature 
of the function f(z). 


Example 13.10 As a more specific example of the use of the method of steepest 
descent, we will work out an approximate expression for the '-function, which is 
defined by the integral expression 


I(s + 1) = i} dp pre °. (13.99) 
0 


For positive integer values of s, this turns out to be intimately connected with the 
factorial function: 


T(s+ 1) =s!. (13.100) 


The I’-function can be used to define factorials of negative and noninteger numbers. 
To apply the method of steepest descent, we extend the integral into the complex 
plane by replacing p with z, 


Tis + 1) = pater. (13.101) 
Cc 


where again C is along the positive real axis, the same contour shown in Figure 13.49. 
The integral for the '-function must now be put into steepest descent form, so that 
w(z) and g(z) can be identified. Remember the desired form is 


Tis + 1) = 7 dz g(zje™™, (13.102) 
Pais 


Neither z° nor e 2 appear to be slowly varying functions and so we will set g(z) = 1. 
This is always a safe choice. That means that w(z) in Equation 13.102 becomes 


w(z) = In(z) — > (13.103) 


Taking the derivative of this function with respect to z and setting it equal to zero 
gives the location of the saddle point: 


Z,, — 5. (13.104) 


Because s is supposed to be a positive real quantity, the saddle point is on the original 
contour, and no deformation is necessary to make the contour cross it. 

It is pretty clear that a, should be zero, but let’s see how this falls out of the 
method. To find a, we first evaluate the second derivative of w(z) at the saddle point. 
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This gives 
os (13.105) 
dz a ies eer ; 
This means that 
Wi= d 13.106 
o- s2 ( . ) 
and 
0 = T. (13.107) 


With this value for 6,, Equation 13.88 gives two values for a9, @ = Oanda, = —7. 
If we were to use the value a, = —7, the contour would pass through the saddle 
point from right to left. Since the original integration was from left to right, the a = 0 
solution should be selected. 

With these values for z,,, W,", %, and the w(z) and g(z) functions, Equation 13.89 
gives the result: 


T(s + 1) = V2stse“. (13.108) 


This is the simplest form of Stirling’s approximation. 


13.2.4 Higher-Order Saddles 


Back in Equation 13.80, where we performed a Taylor series expansion of the function 
w(z) around the saddle point, we assumed the first nonzero term involved the second 
derivative. The result of Equation 13.89 and the validity of all of our examples relied 


Figure 13.50 A Three-Legged Saddle 
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on this fact. An interesting situation occurs if the second derivative, evaluated at 
the saddle point, turns out to be zero. Then we must go to higher-order terms in 
the Taylor series to get the lowest 5 dependent term needed for Equation 13.80. 
Interesting saddle structures occur in these cases. They are saddles for riders with 
more than two legs! An example of the saddle formed when the first nonzero term 
is the third derivative is shown in Figure 13.50. Notice how a single, straight-line 
contour cannot approximate the line of steepest descent near the saddle point in this 
case. 


EXERCISES FOR CHAPTER 13 


1. Consider the complex function 
w= (2-0, 


(a) How many z-plane Riemann sheets are required to describe this function? 
(b) Locate the branch points and branch cuts in the z-plane and show how the 
z-plane Riemann sheets are connected. 


(c) How many w-plane Riemann sheets are required to describe this function? 
(a) Locate the branch points and branch cuts in the w-plane and show how the 
w-plane Riemann sheets are connected. 


2. The function 
wee =a 


is represented on two sheets in the z-plane and two sheets in the w-plane. Place 
the cut on the real axis of the z-plane and indicate where each sheet of the z-plane 
maps onto the two w-plane sheets. 


3. Consider the complex function 
w= (z- 11?@+ 13, 


(a) Locate the branch points and branch cuts of this function in the z-plane. 
(b) How many Riemann sheets are required for the z-plane? 

(c) How are these Riemann sheets connected? 

(d) How many Riemann sheets are required for the w-plane? 


4. Consider the complex function 


(=) 
w=In{ = : 
z- 1 


(a) Identify the branch points and locate the branch cut(s) in the z-plane for this 
function. 
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(b) How many z-plane Riemann sheets are required to represent this function? 
(c) How are the Riemann sheets connected? 
(d) Draw several closed contours, which span multiple sheets in the z-plane. 


. Locate the branch points and branch cuts in the z-plane and describe the connec- 


tion of the Riemann sheets for the following functions: 
i. w = In(? ~ 1). 


For each of the following functions: (1) Identify the number of Riemann sheets 
in the complex z-plane and the complex w-plane. (2) Locate all the branch points 
and position the branch cuts. (3) Show where the z-plane maps onto the w-plane 
and vice versa. (4) Indicate how the Riemann sheets are connected: 


i. w= 7), 
ii. w= 7 
iii, w = cosz 


. The function (z — 1)!/* has branch points at z = 1 and at infinity. For this 


problem, take the cut to be along the positive real axis and evaluate the integral 


[exe ai 1)'?, 
C 


The contour C is the circular path of radius 2, centered at z = 0, that starts 
just above the cut and ends just below the cut on the first Riemann sheet of the 
z-plane, as shown below: 


Sheet 1 


To evaluate this integral show that the contour can be deformed to the one shown 
below: 
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Sheet 1 


Repeat the evaluation of the integral if C is assumed to be on the second Riemann 
sheet of the z-plane. 


8. Consider the real integral 


[= i, dx ~ ; 
0 x7 +1 
Indicate a closed contour in the complex z-plane that can be used to evaluate 
this integral. Explain your reasoning by showing the pole(s), branch points, and 


branch cut(s) of the integrand and discussing the contributions from the segments 
used to accomplish closure. You do not have to actually evaluate the integral. 


i: dx Ve 

0 (x + 1)? 

by extending the integral into the complex plane and using the contour C shown 
below. 


9, Evaluate 


y 


10. Consider the complex function 


1 


o @= DIAG +2) 
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11. 


12. 


13. 


14. 


15. 
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(a) Identify the branch cuts, branch points Riemann sheets, and singularities of 
this function in the z-plane. 


(b) Now consider the real integral 


[ dx 
o (x2 — x)/2(x + 2)" 


Extend this integral into the complex plane and identify an appropriate 
contour to evaluate it. 
(c) Evaluate the integral in part (b). 


Describe the branch cuts and branch points of the function (1 —z”)!/? and evaluate 
the real integral 


i dx 
-1 (4 -2)/1— x2 


Evaluate the real integral 
1 
dx 
— b>a>Q0. 
Us @2 — 1)'2(ax + b) - 
Consider the function w = z? + 1. 


(a) Determine the location of the saddle point for this function. 

(b) Expand w(z) in a Taylor series about the saddle point and determine a, the 
angle of a straight line passing through the saddle point tangent to the path 
of steepest descent. 

(c) Repeat steps (a) and (b) above for the function w = ve +1. 


Consider the function z 


(a) Show that this function satisfies the Cauchy-Riemann conditions. 
(b) Identify the location of a saddle point. 
(c) Describe the paths of steepest ascent and descent through this saddle point. 


Consider the function 


(a) Where in the complex z-plane does this function have saddle points? 

(b) Identify the lines in the z-plane along which Real [w] = 0. 

(c) The lines determined in part (b) divide the z-plane into six regions. For each 
region, indicate whether the real part of w is greater or less than zero. 
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16. 


17. 


18. 


19. 


(d) Near each of the saddle points, determine the directions of the paths of 
steepest descent and steepest ascent through these saddle points. 


(e) Using the method of steepest descent, determine the approximate value of 


the real integral 
ie dx e 8? +3) 
0 


where s is a large, positive real number. 
Let 


w = sinz. 


(a) Where are the saddle points of this function in the z-plane? 
(b) Identify the paths of steepest descent and ascent through these saddle points. 


(c) Make a plot of the lines of constant Real [w] in the complex plane and label 
several of the contours to clearly show the topology. Also draw the lines of 
steepest descent and ascent on this plot. 


Consider the function 
w = sinz — z. 
(a) Show that this function has saddle points at z = 27m where n = 0,+1, 
OD ade 
(b) Expand the function in a Taylor series about the saddle point at z = 0. What 


is the second derivative of w at this point? 

(c) What is the value of the function at z = 0? Use this value to divide the 
z-plane into regions around z = 0, where the real part of the function is 
either larger than or smaller than its real part at z = 0. 


(d) Determine all the possible paths of steepest descent and steepest ascent that 
pass through z = 0. 


Use the method of steepest descent to approximate the value of the integral 


I(x) = i dte** 
0 


for large, positive x. 


Consider the real integral 


= 2 
i dxxe™™*, 
0 


where s is a real positive number. 


(a) Evaluate this integral exactly using the fact that the integrand is an exact 
differential. 
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21. 


22. 
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(b) Now evaluate the integral by the method of steepest descent and compare 
this approximate answer to the exact expression obtained in part (a). 


. Use the method of steepest descent to approximate the Laplace transform of 


FOF NG eo 


and compare it to the exact transform. Now repeat the same process for the 
transform of 
_ jr t>0 
iO . 1<0" 


where n is any positive integer. 


Consider the integral 


1 ortie — exts-s'”) 
I@) = — i ds<——_., 
2m Jy 


ies Ss 


where the path of integration is along a vertical line to the right of the origin in 
the s-plane. 

(a) Locate the branch points of the integrand in the s-plane. 

(b) Place the branch cut so that the path of integration will not cross it. 

(c) Evaluate (x) for negative x. 

(d) For large positive x, use the method of steepest descent to approximate /(x). 


According to transform tables, the inverse Laplace transform of 


de fe 
spe a/s 


is O for ¢ < 0, and equals 


sin (2\/at) 


1 
Jf 7a 


for t > 0. Use the method of steepest descent to approximate 


eS @ 

Qari = fas" aR 2 
Contour plots for the constant real part of w(s) help to see how to deform the 
path of integration. When you deform the Laplace contour £, remember to take 


into account the branch cut, branch points, and the singularity of the integrand. 
The integrand has a slowly varying function of s, and both saddle points of 
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w(s) should be used. How does your approximate answer compare to the correct 
answer for large values of t? What would have happened if you had used only 
one saddle point? If possible, use a computer to make a 3D plot of the real part 
of w(s). 


23. Use the method of steepest descent to find an approximation for the inverse 
Laplace transform of 


1 
FD = aR 


Show the paths of steepest ascent and descent through the saddle point(s) and 
discuss the deformation of the path of integration. Also, include comments on 
any branch points and branch cuts in your solution. Compare your answer to the 
exact value of the inversion, which is 


t<0 


0 
10 = 0) a oe 


Now repeat this process for the inversion of 


1 
PO) = RH) 


14 


TENSORS IN NON-ORTHOGONAL 
COORDINATE SYSTEMS 


This chapter continues our discussion of tensors. It begins with a brief review of tensor 
transformation properties, which was covered in more detail in Chapter 4. Then we 
tackle the complications that arise in non-orthonormal coordinate systems. The ideas 
of contravariance, covariance, and the metric tensor are developed to handle inner 
products in non-orthonormal systems. 


14.1 A BRIEF REVIEW OF TENSOR TRANSFORMATIONS 


In Chapter 4, we discussed how a tensor is defined by its behavior under coordinate 
transformations. With only a hint of sarcasm, the definition was given by the state- 
ment, “A tensor is any quantity that transforms as a tensor.’ What this means is that 
transformation rules are enough to give tensors all of their special properties. If an 
object transforms between coordinate systems using tensor transformation rules, you 
can legitimately say the object is a tensor. 

Remember, the elements of a tensor can be transformed using a transformation 
matrix whose elements can be obtained from the equations relating the coordinates of 
the two systems under consideration. For transformations between Cartesian systems, 
the elements of this transformation matrix [a] are given by 


a;; = (@-8;) = —. (14.1) 


In this equation, the original system has coordinates x; and basis vectors é;. The system 
being transformed to, the primed system, has coordinates x} and basis vectors é}. For 
orthonormal coordinate systems, the inverse of this transformation matrix is always 
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given by its transpose 


a;;' = aji- (14.2) 


An arbitrary tensor of rank n can be expressed in either the unprimed or primed 
coordinate system as 


T = Tip. G8&... = Ti, ¢éle!..., (14.3) 


where there are n subscripts and n basis vectors in each term. T;,,. and T/,, are the 
elements of the tensor in the unprimed and primed coordinate systems, respectively. 
The two sets of elements are related to each other via the transformation matrix 
equation 

jee = T jk... @ris j rk sein (14.4) 
where the matrix [a] appears n times. The inverse transformation is 


To Tj... Fir jsBks band (14.5) 


Our fundamental assertion is that any quantity that transforms in the manner described 
by Equation 14.4 is by definition a tensor. 

Because a vector is a first rank tensor, vector component transformations are also 
described by Equations 14.4 and 14.5. If we write the vector in the two coordinate 
systems as 


V = Vie; = Vie, (14.6) 
the relation between the components is given by 
V! = Vian (14.7) 
and the inverse 
V, = Viair. (14.8) 


Scalars are invariant under a coordinate transformation. We can think of a scalar 
as a zero rank tensor. The single element of a zero rank tensor has no subscripts and 
is not combined with any basis vectors. Equation 14.4 reduces to 


S= S' (14.9) 


where S (or S’) is the single element of the scalar. 


Example 14.1. As an example of using transform properties to identify an object as 
a tensor, consider the Kronecker-6. Recall that this symbol was introduced to form dot 
products in orthonormal coordinate systems. The dot product between two vectors, 
A and B, written in two different orthonormal coordinate systems, one unprimed and 
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one primed, can be written 
A-B = A;Bj8;; = A’B/5).. (14.10) 


Now we already know that 6,; and 6/, can both be expressed as unit matrices, that 
is as [1]. For the purposes of this discussion, however, let’s look at the consequences 
of just requiring that both expressions for the inner product in Equation 14.10 be 
equal, and that A; and B; are vector components, and therefore transform according 
to Equation 14.7. Substituting into Equation 14.10 for A! and B! gives 


A,B 63; =, 4, Ajas;B;5,,. 
= A;B jaria5;8),. (14.11) 
Because this expression must be true for any A and B, we can write 


8 = ris ;,,. (14.12) 


Inverting this expression gives 
5; = Gir js5ys. (14.13) 


Comparing Equations 14.12 and 14.13 with Equations 14.4 and 14.5, it can be seen 
that the elements of the Kronecker-6 transform like the elements of a second-rank 
tensor. Therefore, the Kronecker-5 symbol is a second-rank tensor, which can be 
expressed with basis vectors as 


5 = 6,€:e; = 5/,8/8'. (14.14) 


14.2 NON-ORTHONORMAL COORDINATE SYSTEMS 


Up to this point, we have dealt only with orthonormal coordinate systems. In Cartesian 
systems, the basis vectors 6; are independent of position and orthonormal, so é; - 
é; = 6,;. In curvilinear systems, the basis vectors q; are no longer independent of 
position, but they are still orthonormal, so q; - @; = 5;;. Now we will consider non- 
orthonormal systems. To distinguish these systems, we will label the basis vectors 
of non-orthonormal coordinate systems as g,;, and the non-orthonormality condition 
becomes §; - &; # 5;;. To keep the discussion and derivations as simple as possible in 
this chapter, we will limit ourselves to coordinate systems which have basis vectors 
that do not vary with position. This is obviously not the most general type of non- 
orthonormal coordinate system, but it is enough to demonstrate the important ideas 
of contravariance, covariance, and the metric tensor. In Appendix F, we present some 
of the generalizations required to describe non-orthonormal systems with position- 
dependent basis vectors. 

In physics, non-orthonormal coordinate systems appear, for example, in both 
special and general relativity. The basic postulate of special relativity is that the 
speed of light c, is the same in all inertial reference frames. As a consequence of 
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Figure 14.1 The Coordinate Systems of Special Relativity 


this postulate, both the position and time coordinates of some physical phenomenon 
(an “event”) change as you change reference frames. This is very similar to how the 
components of a vector change as you transform the coordinate axes used to describe 
them. If we restrain the motion to be along a single spatial direction, an event can be 
described by two coordinates, a single position coordinate and time. As will be shown, 
the observation of an event in two different coordinate systems, one primed and one 
unprimed, can be plotted as a single point using the combined set of axes as shown in 
Figure 14.1. By taking its components with respect to both coordinate axes, we can 
get all the relationships imposed by special relativity. Notice how the x and ct axes 
intersect at right angles, but the x’ and ct! axes do not. While the unprimed system 
appears to be orthogonal, the primed system looks like a non-orthogonal, skewed 
system. We will discuss this picture, and its interpretation for special relativity, in 
some detail in the sections that follow. 

The basic postulate of general relativity is that gravity and acceleration are equiv- 
alent. Events observed in a gravitational field appear as if they were being observed 
in an accelerating coordinate system. This implies that light propagating through the 
gravitational field of a massive object, such as a star, should be bent, as depicted in 
Figure 14.2. This would cause the apparent position of a star to deviate from its actual 


Figure 14.2. A Coordinate System of General Relativity 
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location. This phenomenon was first observed by Arthur Eddington, who measured 
the small deflection of stars by the sun during the total eclipse of 1919. The paths that 
light rays follow through space are called geodesics. A natural choice for the grid lines 
of a localized, coordinate system follows these geodesics, as shown in Figure 14.2. 
The basis vectors in such a coordinate system are both non-orthonormal and spatially 
dependent. We will not discuss examples of this type of system, but instead restrict 
our comments to “skewed” systems, where the basis vectors are not orthonormal, 
but are spatially invariant. The interested reader can investigate the very heavy book 
Gravitation by Misner, Thorne, and Wheeler for an introduction to general relativity. 


14.2.1 A Skewed Coordinate System 


Consider the two-dimensional, Cartesian system (x), x2) and the non-orthonormal, 
primed system (x{,x}) shown in Figure 14.3. Two pairs of basis vectors and an 
arbitrary vector V are also represented in the figure. The basis vectors of the unprimed 
system are orthonormal: 


é; : é; = 6;;. (14.15) 


8-8) = &-& = 1, (14.16) 


8! - a! # 8). (14.17) 


Figure 14.3. An Orthonormal and a Skewed Coordinate System 


NON-ORTHONORMAL COORDINATE SYSTEMS 567 


As you will see, the formalism that is developed here does allow for systems with 
basis vectors that are not unit vectors. However, to start out as simply as possible, we 
assume the basis vectors of the skewed system satisfy Equation 14.16. 

In the orthonormal system, the vector V can be expressed as the sum of its axis 
parallel projected components, shown in Figure 14.3, coupled with the corresponding 
basis vectors: 


V = V; é, + V2 €. (14.18) 


These vector components are just the projected lengths of V along the axes of the 
unprimed system and can be determined with simple trigonometry or by following 
straightforward vector manipulation. A particular component is obtained by the dot 
product between the vector V and the corresponding basis vector. For example, to 
find V;: 


V-@ = (V, & + Vo @)-& 
= Vi « 1) + Vo(E - €) 
= V8 + V2821 
=Vj\. (14.19) 


This works out nicely because of the orthonormality of the basis vectors. 
In the primed system, the same vector can be written in terms of its axis-parallel 
projected components and the primed basis vectors, as shown in Figure 14.3, 


V=Vi ai + VSB. (14.20) 


These primed vector components can also be determined from trigonometry, but 
because of the skewed geometry, this is no longer as simple as it was in the orthogonal 
system. Because the primed basis vectors are not orthogonal, an attempt to isolate 
a particular primed component by a vector manipulation similar to Equation 14.19 
fails: 


Mt 


V-& = (Vi, 8 + V2 &)- Bi 

Via - &) + V2 - 81) 

= Vi + Vas ° 81) 

#Vi. (14.21) 


i 


It appears that vector manipulations in non-orthonormal coordinate systems are 
much more difficult than in orthonormal systems. Fortunately, there are some formal 
techniques which simplify this process. In the next section, we introduce the concepts 
of covariance, contravariance, and the metric tensor. Using these devices, the dot 
product between two vectors has the same form in an orthogonal system as in a 
non-orthogonal system. 
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14.2.2 Covariance, Contravariance, and the Metric 


The basic complication introduced by a non-orthonormal coordinate system is evident 
in the dot product operation. In the two-dimensional orthonormal system described 
above, the inner product between two vectors is given by 


A-B = Aje; - Bye; 
bas AjB j6;; 
= A,B, + A2B>. (14.22) 


If this same inner product is performed in the non-orthogonal system of Figure 14.3, 
the result contains some extra terms: 


A-B= A; 8;- Bj 8; 
= AjBi(&; - 8) 
= A{ Bi + A}By + (AjB3 + A5Bi)(B; * 82). (14.23) 


The inner product formed in the non-orthonormal system, as expressed in Equa- 
tion 14.23, can be placed in the from of Equation 14.22 by rearranging as follows: 


A: B= Aj[By + Bs(8 - &5)) + AS[Bi (6) - 8) + By). (14.24) 
Now define a new set of components for B as 
Bi = By + By - &) 
By = By (8; - 8) + B. (14.25) 


These quantities are called the covariant components of B, while the original compo- 
nents are called contravariant. Clearly, the vector B cannot be expressed by combin- 
ing these new covariant components with the basis vectors §; and }: 


B + Big) + Bg). (14.26) 
However, with these components, the inner product evaluated in the skewed system 
can be placed in the simple form 
A+B = A‘B! 
= AlBi + AjB3. (14.27) 
Notice that the inner product could also have been written as 
A- B= AjB;, (14.28) 
with the covariant components of A defined as 
Aj = Aj + Axi * 8) 
AS = Aji(8} - &) + A}. (14.29) 
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The inner product needs to be formed with mixed contravariant and covariant com- 
ponents, but it does not matter which vector is expressed with which type. 

These arguments can be extended to non-orthogonal systems of arbitrary dimen- 
sions. The restriction that the basis vectors be normalized to unity can also be dropped. 
The covariant components can be generated from the contravariant components using 
the general expression 


Aj = Ai (®} - }). (14.30) 


We have used the subscript summation convention here to imply a sum over the index 
j. For an n-dimensional coordinate system, there will be n terms in each sum. Notice 
that if the coordinate system is orthonormal, Equation 14.30 reduces to A/ = A/, and 
the covariant and contravariant components are equal. In this case, both Equations 
14.27 and 14.28 will revert back to Equation 14.22. This is important, because it 
implies this new notation is general enough to handle all of our previous Cartesian 
and curvilinear coordinate systems, as well as the new non-orthonormal ones. 

There is another way of expressing the inner product between two vectors in a 
non-orthonormal system that makes use of a quantity called the metric. As will be 
shown later, the metric turns out to be a second-rank tensor. The elements of the 
metric, in an unprimed system, are defined as 


Mi; = 8; * 8). (14.31) 
Notice this definition implies that the metric is symmetric: 
Mi; = Mj. (14.32) 
Using the metric, Equation 14.30 can be written as 
A; = AjMijj. (14.33) 


The metric converts the contravariant components to covariant components. 
Now the inner product between A and B can be written as 


A % B ar. A;BjMij. (14.34) 


A sum over both i and j is indicated on the RHS of this equation. If the sum over i is 
performed first, Equation 14.34 becomes 


When the sum over j is performed first, Equation 14.34 instead becomes 
A-B = A,B;. (14.36) 


When Equation 14.34 is used for the inner product, the vector components are not 
mixed. Contravariant components are used for both. If the system is orthonormal, 
M,; = 6), and the standard inner product for orthonormal systems results. Notice 
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that the metric is determined solely by the basis vectors of the coordinate system. 
This turns out to be an important fact, and will allow us to identify the metric as a 
second-rank tensor. 

In summary, there are two ways to perform the inner product between two vectors 
in a non-orthonormal system. One way is to use the covariant and contravariant 
components, as was done in Equations 14.27 and 14.28. A completely equivalent 
method is to use the metric and the regular contravariant vector components, as 
demonstrated in Equation 14.34. These arguments can be naturally extended to inner 
products between tensor quantities, but this generalization will be postponed until 
the transformation equations for non-orthonormal systems are worked out. 


14.2.3 Contravariant Vector Component Transformations 


Imagine two different skewed coordinate systems, as shown in Figure 14.4. We 
want to find out how the contravariant components of a vector expressed in the first 
system can be transformed into the second system. The first system has the unprimed 
coordinates x; and the basis vectors €;, while the second system uses the primed 
coordinates x/ and basis vectors @/. Remember, we are still limiting ourselves to 
coordinate systems with constant basis vectors. Let the genera] equations which 
relate the two sets of coordinates to one another be 


Xf = x}{(x1,X2, X3) 
Xi = Xi(X], x4, X4). (14.37) 
There will be one pair of equations for each dimension of the systems. 


In our previous work dealing with transformations between orthonormal coordi- 
nate systems, we were able to relate vector components of one system to another via 


Figure 14.4 Two Skewed Coordinate Systems 
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[a], the transformation matrix: 
Vi= aj;Vj. (14.38) 


The restriction to orthonormal systems allowed us to invert this expression easily, 
because it turned out that a;;' = aj. We can still write a relationship similar to 
Equation 14.38 for transformations between non-orthonormal systems, but we need to 
be more careful, because the inverse of the transformation matrix is no longer simply 
its transpose. To keep tabs on which transformations have this simple inversion and 
which do not, we will reserve the matrix [a] for transformations between orthonormal 
systems. The matrix [t} will represent transformations from unprimed coordinates to 
primed coordinates, where the systems may be non-orthonormal: 


Vv; =. tijVj. (14.39) 


The reverse operation, a transformation from primed coordinates to unprimed coor- 
dinates, will use the matrix [g], 


Vi = giVj, (14.40) 
where g;; = t;;' # t,. By their very definition, it follows that t;;g j, = 5. We will 
discuss the relationship between the [t] and [g] matrices in more detail later. In both 
these expressions, the vector components are the regular, contravariant components 
of V, not the covariant components we introduced earlier. 

All the vectors at a given point transform using the same [t] matrix. To determine 


the 1;; elements, it is easiest to consider the displacement vector dr, which in both 
coordinate systems is given by 


d¥ = dx; 8 = dx; &. (14.41) 
Applying this equality to Equation 14.39 gives 
dx} = hij dx;. (14.42) 


Referring to Equations 14.37 gives the relationship 


dx! = = dx, (14.43) 
e, 


i= —. 14.44 
aes (14.44) 
So far, these results look very similar to those of the Cartesian transformations of 
Chapter 4. In fact, the equation for the components of [1] given in Equation 14.44 is the 
same result obtained for the [a] matrix between Cartesian systems. The complications 
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arise when we try to invert these equations. As we mentioned before, the inversion 
of [1] is no longer simply its transpose. A general way to obtain [t]~!, what we are 
calling [g], is to use the expression 

Cii 


earptea= 


ij = Fi; 


; 14.45 
It; j Ider : ? 


where cj; is the ji-cofactor of the t;; matrix. From matrix algebra, this cofactor is 
defined as (—1)'*/ times the determinant of the #;; matrix, with the jth row and the 
ith column removed. The [g] matrix can also be obtained from the equations relating 
the coordinates in exactly the same manner that led to Equation 14.44: 


~1 Xi 


8ij ij ae (14.46) 
Jj 


Ih 


The [t] and [g] matrices can also be used to relate the basis vectors to one another. 
Using the contravariant components, any vector V can be expressed in the primed or 
unprimed systems as 
Substituting Equation 14.39 into Equation 14.47 gives 

V = V; 8; = Vj ti 8. (14.48) 

Since this expression must be true for any V, we must have 

£; = ti g}. (14.49) 
Following the same steps above, using [g] instead of [t] gives 

8) = gi; Bi. (14.50) 
Notice the contravariant vector components are transformed by contractions over 
the second subscript of either ¢;; or g;;, while the basis vectors are transformed by 
contracting over the first subscript. 


To summarize the results of this section, the transformation between the two 
non-orthonormal coordinate systems is governed by the relations 
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14.2.4 Subscript/Superscript Notation 


Before proceeding with a discussion of how covariant vector components transform, 
it turns out to be convenient to introduce some new notation. The tilde (V,) notation we 
have been using for the covariant vector components is clumsy. It is not obvious that 
the following conventions are much better, but they do provide a valuable mechanism 
for keeping track of which type of component (contravariant or covariant) should 
be used in an expression. The standard, axis-parallel projected vector components, 
which we have called the contravariant components, will now be labeled with a 
superscript, while the new covariant components will use a subscript instead of a 
tilde. For example, the contravariant components of the vector V are V‘, while the 
covariant components are V;. 

One advantage of this new notation is evident by looking at the form of the inner 
product. With the superscript/subscript convention, we can write the dot product of 
A and B as 


A-B =-A'B, = A;B'. (14.51) 


Notice the index being summed over appears once as a subscript and once as a su- 
perscript. This, of course, is the same as saying that the sum is done over mixed 
contravariant and covariant quantities. This process of mixed superscripts and sub- 
scripts will persist for almost all contractions over a repeated index. It even works 
in forming a vector from its components with the proper interpretation of the ba- 
sis vectors. We know that the vector can be formed with the contravariant vector 
components and the basis vectors: 


¥ = vig, (14.52) 


To be consistent with the subscript/superscript convention, the basis vectors must be 
labeled with subscripts and be considered covariant. We will see, in the next section, 
that this conclusion is consistent with the way these basis vectors transform. 

This convention also prevents us from accidently forming a vector by combining 
its covariant vector components with the g; basis vectors: 


V + Vig. (14.53) 


The notation warns us that this is not correct because both indices appear as subscripts. 
In the previous section we generated several relations which described how the 
contravariant components of a vector V transform between two skewed systems. How 
should the presentation of these results be modified to be consistent with the new 

superscript/subscript convention? In the previous section we had written 
V/ = tyV;. (14.54) 


Now these vector components need to be superscripted. To be consistent with this 
new notation, one of the indices of the transformation matrix needs to be a subscript 
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and one a superscript, 


vist, Vv, (14.55) 
where 
en ax!! 


In a similar manner, the inversion of Equation 14.55 becomes 


vi= giv’, (14.57) 
where 
: ax! 
eS oat (14.58) 


In Equations 14.56 and 14.58, notice how the superscripted component in the de- 
nominator of the partial derivative results in a subscripted index on the LHS of 
this expression. This is a general property of partial derivatives with respect to con- 
travariant and covariant quantities. A partial derivative with respect to a contravariant 
quantity produces a covariant result, while a partial derivative with respect to a 
covariant quantity gives a contravariant result. We will prove this fact later in the 
chapter. 

These transformation matrices have what is called mixed contravariant/covariant 
properties. They are contravariant with respect to one index, but covariant with respect 
to another. We have not really defined what these words mean for quantities with 
more than one index yet, and we will defer this to a later section of this chapter, where 
we talk about tensor transformations. 

With the earlier notation, the reciprocal nature of [¢] and [g] was indicated by 
the equation ¢;;g, = 5x. But now, to be consistent with the subscript/superscript 
convention, we should write 


git, = 3. (14.59) 


The Kronecker-5, written in this way, also has mixed contravariant and covariant 
form. 

Equations 14.49 and 14.50, which indicate how the basis vectors transform, are 
written using superscript/subscript notation as 


| Si ', 8 

8) = g'; Bi. (14.60) 
Notice how the horizontal positioning of the indices of the transformation matrix is 
important. In Equation 14.55 and 14.57, the sum was over the second index of the 


matrix, while these sums are over the first index. This prevents us from writing the 
[t] matrix elements as ¢', since this would no longer indicate which index comes first. 
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We should also rewrite the relations involving the metric using the new notation. 
Our previous definition of the metric was in terms of the covariant basis vectors. 
Consequently, Equation 14.31 remains unchanged: 


and both indices remain as subscripts. Formed this way, the metric elements are purely 
covariant because both indices are subscripts. The metric converts the contravariant 
components of a vector to its covariant components, within the same coordinate 
system. This operation can be written, using the superscript/subscript notation, as 


V; = MijV!. (14.62) 


Notice how the summation convention continues to work. This same operation in a 
primed system uses a primed metric, 


Mj; = 8 8, (14.63) 
and is written as 
Vi = MiV". (14.64) 


In summary, the equations governing the transformations of the contravariant 
components of a vector V can be written using the new superscript/subscript notation 
as: 


t, = dx!!/ax! gi, = ox! fax! 
ee Leys Re dea 
yl = rjV? y' = giyv? 


A 


gj = 1,8! 8) = 3! 


The covariant components of V can be obtained from the contravariant components 
using the metric: 


There are clearly some holes in this picture. First, there is the question of how 
the covariant components of a vector transform. Second, we said the basis vectors §; 
were covariant in nature. We need to prove this. Finally, can we define contravariant 
basis vectors? These issues are all intimately related to one another, and are addressed 
in the next section. 
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14.2.5 Covariant Vector Component Transformations 


Return to the pair of skewed coordinate systems described in Figure 14.4. The 
covariant vector components of the vector V will transform according to some linear 
relation 


V! = [2]V;. (14.65) 


To determine [?] in this expression, consider two equivalent forms of the inner product 
of two vectors, one in the primed system and one in the unprimed system: 


A-B = A' B; = A! Bi. (14.66) 
The contravariant components of A transform according to the rules determined 
earlier: 
AU = thai, (14.67) 
Substituting this expression into the far RHS of Equation 14.66 gives 
A’ B; = Alt’, Bi (14.68) 
Since this equation must be true for any A, we must have 
B; = #, Bi. (14.69) 
This expression is easily inverted to give 
B! = g, B;. (14.70) 


Notice the similarity between Equations 14.60, 14.69, and 14.70. This supports our 
conclusion that the axis-parallel basis vectors are covariant. 

We were able to combine the contravariant components of a vector with the 
covariant basis vectors to form the vector itself: 


V = Vg). (14.71) 


It would be nice to come up with a new set of contravariant basis vectors, 8', which 
could be combined with the covariant vector component to form the same vector. 
That is, 


V=Vie. (14.72) 


In fact, we can use this expression to define these contravariant basis vectors, and 
look at the consequences. 

The basic properties of the contravariant basis vectors can be deduced by again 
considering the inner product between two vectors, A and B. If A is expressed using 
covariant basis vectors and contravariant components, while B is written with con- 
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travariant basis vectors and covariant vector components, the inner product becomes 
A-B = A’ 6, -B; @ 
= A' B; 6; - &. (14.73) 
According to Equation 14.51, this expression must be equal to A’ B; and so 
ee ={ oe) (14.74) 
0 
or in terms of the Kronecker symbol, 
8 - 8! = 6,/. (14.75) 


This last condition allows both the magnitude and direction of the contravariant 
basis vectors to be determined, if the covariant basis vectors are already known. 
Working in two dimensions, @! - = 0 and g! - &, = 1. In words, g! must be 
perpendicular to #2, while its projection along the 1-axis, parallel to ,, must be 
one. This uniquely determines #' and, by similar arguments, 8). The conditions of 
Equation 14.75 can be pictured graphically as shown in Figure 14.5. The constructions 
in this figure have been made assuming that |é,;| = 1. 

The covariant and contravariant vector components can also be interpreted graphi- 
cally, as shown in Figure 14.6. Again, for this figure, it has been assumed that |g;| = 1. 
The contravariant vector components are simply the magnitudes of the axis-parallel 
projections of the vector onto the skewed axes defined by the covariant basis vec- 
tors. The covariant components are the magnitudes of the projections of the vector 
onto the same coordinate axes, but following lines parallel to the new contravariant 
basis vectors. This makes the projection lines for the covariant vector components 


12 


v=) 
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Figure 14.5 Determination of the Contravariant Basis Vectors 
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Figure 14.6 Covariant and Contravariant Component Projections of a Vector 


perpendicular to the axes, as shown in this figure. The geometry insures that 
V=Vig, = Vie. (14.76) 


If the covariant basis vectors are not unit vectors, the constructions of Figures 14.5 and 
14.6 must be adjusted appropriately, following the requirements of Equations 14.75 
and 14,76. 

Transformations for the contravariant basis vectors fall directly from Equation 
14.76 by the techniques we have applied several times: 


a = 1p (14.77) 

a = 3 8”. (14.78) 

This confirms our classification of these new basis vectors as contravariant, because 
they transform exactly like the contravariant components of a vector. 

The complete set of transformation rules for both contravariant and covariant 


vector components and basis vectors can be summarized by the symmetric set of 
relations: 


with 


ees ae err 
| t'; = Ox * Lax 3, = ax! fax". 


NON-ORTHONORMAL COORDINATE SYSTEMS 579 


Notice that contravariant quantities are always transformed by a summation over the 
second index of either f'; or g' ;, while covariant quantities transform by summing 
over the first index. For contravariant quantities, t ; 18 used to go from the unprimed 
to the primed system, while g' ; is used to go from the primed to the unprimed system. 
For covariant quantities, the roles of t' j and g' j are reversed. 

The new contravariant basis vectors allow us to construct another version of the 
metric, this time with superscripts: 


M' = @ - B’. (14.79) 


Application of this form of the metric converts covariant quantities into contravariant 
quantities. For example, 


Vi = MY Vj. (14.80) 


We will see in the next section that the two different metrics, M;; and M'/, are simply 
different representations of the same object, the metric tensor. 


14.2.6 Covariance and Contravariance in Tensors 


The covariant and contravariant properties discussed above are easily extended to 
tensors. Just as a vector can be expressed with contravariant or covariant components, 


V=V'e,=V, 8, (14.81) 
a tensor can be expressed using purely contravariant or covariant components: 
T = 7T'* 6 8) & = Tix Bt 8 (14.82) 


However, higher-rank tensors are more flexible than vectors because they can also 
be expressed in a mixed form, with both contravariant and covariant indices. For 
example, 


T=Tik so & (14.83) 


is another equivalent representation of T. 

All the tensor expressions in Equations 14.82 and 14.83 are equivalent, although 
the specific values of the components will be different in each case. Just as the 
covariant and contravariant components of a vector are related via the metric, the 
different representations of T can be obtained from one another using the same 
metric. For example, if the two expressions for T in Equation 14.82 are equal, we 
can write 


TUE = M" MI" M™ Tin. (14.84) 
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The expression in Equation 14.83 gives the same tensor when 
rik = Mj, T™. (14.85) 


To convert a set of tensor components from purely covariant to purely contravariant 
form, one metric operation is needed for each index. 

Coordinate system transformations of tensors follow the same pattern we estab- 
lished for vector transformations. One transformation matrix of the appropriate kind 
is used for each index. For example, we might write 


Ti = Bi thn 8% T1"n (14.86) 
Tit gt eT a (14.87) 


Example 14.2 Reference has been made to the fact that the metric is a tensor, but 
we have not really proven this fact. The proof is straightforward. Consider the metric 
elements, expressed in pure covariant form as, 


Mi; = g; i g)- (14.88) 


The inner product between two vectors, expressed in two different coordinate systems, 
can be written 


A-B = Ai B/ M,; = A'" B'" M,.,, (14.89) 


where M/,, = &),: 1. The transformation equations can be used to express the primed 
vector components in terms of the unprimed components. This gives 


A’ BY Mj; = A’ BY tt"; Mjp. (14.90) 
Since this expression must work for all possible A and B, we must have 
Mi = 07 0"; Mins (14.91) 
which easily inverts to give 
Mi; = 871 8"; Mnn- (14.92) 


But this is exactly how the elements of a second-rank tensor must transform, and so 
by definition the metric is a tensor. This means that we can write 


M = M;; & @’. (14.93) 


Since the metric is a tensor, we can modify its covariant or contravariant nature 
as we would for any tensor. Although it might seem a little odd to use the metric to 
modify itself, we can change a purely covariant metric to a purely contravariant form 
by applying the metric twice: 


NON-ORTHONORMAL COORDINATE SYSTEMS 581 


M!! = M'"™ M™ Man. (14.94) 
We can also put the metric in mixed form by writing 
M', = M™ Mnj- (14.95) 
Using the transformation equations, you can easily show that 
M', = @ - 8; = 54). (14.96) 


This implies that the metric tensor is really just a generalization of the Kronecker-6 
tensor. 


14.2.7. The Contravariance and Covariance of Partial Derivatives 


When partial derivatives are taken with respect to a contravariant coordinate, the result 
is a covariant quantity. To see this, let the contravariant coordinates in an arbitrary 
pair of coordinate systems be x! and x''. The rules of calculus require that 


a _ oxi @ 


Blt = BM Bx” iad 


where there is an implied summation over the index j. But notice, the dx/ /dx!! term 
is exactly the definition of g’,. This lets us write 


é 


ox! 


te) 


uprated 
* axt 


(14.98) 
Comparing this expression with Equation 14.70 shows that the partial derivative 
operation transforms just like a covariant quantity. The same type of argument holds 
for partial derivatives with respect to covariant coordinates. In that case we find 


0 Ox; O 
ee See 14,99 
ox! = OX! Ax; 
= ge (14.100) 
Ox; 


which says that this partial derivative acts like a contravariant quantity. To be consis- 
tent with our superscript/subscript conventions, we impose the rule that a superscript 
in the “denominator” of the derivative operation acts like a subscript, while a sub- 
script in the denominator acts like a superscript. This idea was discussed briefly in 
connection with the transformation matrices in Equations 14.56 and 14.58. 


Example 14.3 A static electric field is often calculated by taking the gradient of a 
scalar potential: 


E = —-Vo. (14.101) 
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In an earlier chapter, we defined the gradient operator by the relation 
db =Vo- dF. (14.102) 
Since the displacement vector can be written 
d¥ = dx' 8;, (14.103) 
where dx’ is a contravariant quantity, it is clear the gradient of ¢ can be written as 
Vo = — #. (14.104) 


To check the validity of this expression, insert Equations 14.104 and 14.103 into the 
RHS of Equation 14.102 to obtain 


Vo-ar= Soe -dxg, 


9b | 5 gi 
~ oi dx! &, (14.105) 
_ Of ,; 
~ axi ue 
= dd. (14.106) 


When we write the components of the electric field as 
E; = -=: (14.107) 


they are covariant and must transform according to the relation 


Ej = g/Ej. (14.108) 


Example 14.4 A static magnetic field B can be calculated using Ampere’s law: 


par -B= pol. (14.109) 
c 


In this expression, J is the total current flowing through the closed path C. Taking 
d¥ to be a differential vector quantity with contravariant components, as given in 
Equation 14.103, the components of the magnetic field used in this integration should 
be written in covariant form, so that 


fae B; = pol. (14.110) 
Cc 


NON-ORTHONORMAL COORDINATE SYSTEMS 583 


14.2.8 Special Relativity 


Special relativity deals with the observation of events as seen from two different 
coordinate systems, one moving at constant velocity with respect to the other. For 
simplicity, we will limit most of this discussion to systems with one spatial dimension, 
as shown in Figure 14.7. The primed system is moving, relative to the unprimed 
system, with a velocity v, in the x-direction. Einstein’s favorite example of this 
situation was a railway car moving at a constant velocity on straight, flat tracks. A 
coordinate system fastened to the tracks, which he called the embankment, is the 
unprimed system and a system attached to the moving car is the primed system. 
These two coordinate systems form different reference frames for the observation of 
events. 

Special relativity treats time as another dimension, on an equal footing with the 
spatial coordinates. To make this work out, the time coordinate needs to have the 
same units as the spatial coordinate. This is accomplished by multiplying the time 
parameters by the speed of light c,. An event occurs at a specific set of values of the 
space-time coordinates. For these coordinates in the unprimed system, instead of x! 
and x2, we will use x and cot: 


xtox 


x? > Col. 


Even though we have not labeled these coordinates with superscripts, you should 
realize that they are actually contravariant quantities. In the primed system, this same 
event occurs at the primed coordinates: 


x!? — cot! 

The Lorentz Transformation In the previous section, we showed two different sets 
of coordinates could describe the same event. Now we ask, what is the transformation 
that converts one set of these coordinates to another? To answer this, we need to briefly 
discuss the fundamental assumption of special relativity. 

The basic postulate of special relativity is that the speed of light must be constant, 
regardless of the reference frame from which it is being measured. This is a strange 
idea, apparently fraught with paradox. It implies, for example, that the light coming 
out of the headlights of a moving car, as observed by someone standing on the 


Figure 14.7 The Inertial Systems for Special Relativity 
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ground outside the car, travels at the same speed, independent of the speed of the car. 
It also has other strange consequences, such as the dilation of time and contraction of 
lengths for fast-moving objects. Nevertheless, it has been confirmed by many different 
experiments. Our goal here is not to delve into these ideas and their associated 
paradoxes, but rather to show how the transformation between inertial frames can be 
looked at using a skewed coordinate system. For a complete introduction to the ideas 
of relativity, the reader should investigate the excellent book, Spacetime Physics, by 
Edwin Taylor and John Wheeler. 

What does the invariance of the speed of light say about the transformation matrix 
that converts the unprimed coordinates to the primed coordinates? In terms of the 
coordinates defined above, if an object travels at the speed of light, following the line 


xX = Col, (14.111) 
then the transformation matrix [*] must convert this into 
x' =cot', (14.112) 


so the object is still moving at the same speed in the primed coordinate system. The 
linear transformation that accomplishes this was worked out by Hendrick Lorentz, 
and bears his name. With one spatial dimension, the Lorentz transformation is given 
in matrix form as 


ti, te aa : 14.113 
ey, ES Yo ( ) 
where 
B = (14.114) 
Co 

and 

y= : (14.115) 

a he Be 


The inversion of the matrix in Equation 14.113 is straightforward and gives the 
elements of the [g] matrix as 


gi BA | (14.116) 


It should not be surprising that this matrix is simply the [t] matrix with the sign of v, 
reversed. 

The form of these transformations implies an important fact. Given the coordinates 
of an event observed in the unprimed frame (x, cot), the quantity x? — (c,t)* is invariant 
to transformation. That is, if we form this same quantity using the event’s primed 
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components, we find that 
x el) Sa let). (14.117) 


This statement is true for any x and c,t, as long as the primed and unprimed coordinates 
are related by the Lorentz transformation. This is easily shown by substituting 


x! = Yo X — YoBo Cot (14.118) 
and 
Cot! = —YoBo X + Yo Cot (14.119) 


into the RHS of Equation 14.117. 


Skewed Coordinate Representation There is a clever way, using a skewed co- 
ordinate system, to visualize the relationship between the primed and unprimed 
coordinates that is imposed by the Lorentz transformation. Imagine the unprimed 
coordinate axes are perpendicular, and introduce the orthonormal, covariant basis 
vectors @, and %,, as shown in Figure 14.8. The covariant basis vectors of the primed 
system can be determined using Equation 14.60 and the elements of the g’ j Matrix: 


[ér #2) =[& &] BA Beak (14.120) 
Notice we have arranged the basis vectors horizontally to be consistent with the fact 
that the summation is done over the first index of g’ ; - According to these equations, 
the primed basis vectors form a skewed set of coordinate axes, with an angle of 
6 = tan’! B, between the primed and corresponding unprimed axis. This is shown 
in Figure 14.8. As the relative velocity between the reference frames increases, 0 
increases and the prime system becomes more drastically skewed. 

This representation has several things going for it. An event, with coordinates 
(x, Cot) in the unprimed system and (x’, cot’) in the primed system, can be represented 
as a single point for both systems, as shown in Figure 14.9. Except for a small 
complication due to how the axes are scaled, the contravariant components in each 
coordinate system can be determined by axis-parallel projections of the point onto the 
appropriate axes. Notice, the line x = cot, called a light line, plots as the straight line 
at 45°. The same line could also be written as x’ = c,t', graphically demonstrating 
that the speed of light is invariant. 

If an object is observed in the unprimed frame, moving at constant velocity 
x = vt, the motion plots as a straight line in the two-dimensional coordinate systems. 
If v < cp, this line must lie above the light line, as shown in Figure 14.10. The same 
line describes the motion as observed in the primed system, and so the motion in this 


frame is still at constant velocity, x’ = v't'. Notice, however, that v' # v. We can 
y 
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Figure 14.8 The Two-Dimensional Lorentz Coordinate Systems 


Figure 14.9 Lorentz Transformations of Events 
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ct ; 


ct 
° 


Figure 14.10 Motion at a Velocity Less Than the Speed of Light 


determine the primed velocity from: 


Ax' 
! — 
way (14.121) 


This velocity can be calculated directly from the transformation equations as 


v' =¢, (2B) (14.122) 


Co — Bov 


As can be seen from this expression and from the figure, as v + Cy, v' > Co. 

Another classic consequence of special relativity is immediately evident from the 
graphical representation of coordinates in Figure 14.9. Two events which occur at 
the same time in the unprimed frame, occur at different times in the primed frame. 
Imagine two events with the same value of cot, but different values of x. They would 
occur on a horizontal line in Figure 14.9. In the primed system, not only will they 
have different values of x’, but they also will have different values of cot’. 

The representation of Figure 14.8 for describing the transformations of special 
relativity does have problems, however. First, consider the magnitude of the basis 
vectors of the primed system. These covariant basis vectors are specified in terms of 
the unprimed basis vectors by Equation 14.120. Specifically for g}: 


&; = Yo 81 + YoBo &2.- (14.123) 
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The magnitude squared of this basis vector is obtained by an inner product of the 
basis vector with itself: 


| = 8 - 8. 


= (Yo £1 + YoBo B2) * (Yo £1 + YoRo &2) 
= ya -f: B2), (14.124) 


lai 


where the last step follows because the unprimed basis vectors have unit magnitude. 
As long as the primed system is moving with respect to the unprimed system, that is 
if v. # 0, Equation 14.124 says that g{ and, by a similar argument §}, are not unit 
vectors. This in itself is not that serious a problem, because our formalism does easily 
handle nonunit basis vectors. It does, however, complicate the geometrical picture 
of Figure 14.8, because you cannot simply axis-parallel-project a vector onto a basis 
vector to obtain a vector component. You must also divide by the magnitude of the 
corresponding basis vector. 

A more serious problem arises when you consider the inner product in this system. 
In particular, let’s look at F - F, the inner product of the position vector with itself. 
This quantity is a scalar and should be invariant to transformations. In the unprimed 
frame, 


r=x =, + Cot £2, (14.125) 
so we expect the quantity r - r to be given by the expression 
x* + (cot). (14.126) 


But we already determined in Equation 14.117 that special relativity requires the 
quantity x? — (c,t)* to be invariant. It is not possible for both quantities to be 
invariant. The problem lies in our assumption about the geometry of the space. The 
value of ¥ - F is not given by Equation 14.126! This is discussed in the next section. 


The Metric of Special Relativity Both the problem with the magnitudes of the 
primed covariant basis vectors and the invariance of the magnitude of the position 
vector involve some assumptions about how to take an inner product. In both cases, 
we took the inner product of a vector with itself in the unprimed coordinate system 
assuming an Euclidian metric of the form 


1 0 
Mi; > E ; (14.127) 


In effect, this was the metric used in the calculation of Equation 14.126: 


FeF=[x cof] lo ql [ 


= x? + (cot). (14.128) 


NON-ORTHONORMAL COORDINATE SYSTEMS 589 


With a little reflection, it can be seen that the invariant quantity of Equation 14.117 
can be generated if we use an alternate metric: 


1 0 
My | ae (14.129) 


When this metric is used in the unprimed system, Equation 14.126 is modified to 


= x? ~ (cot. (14.130) 


How does this new metric affect the orthogonality and the magnitudes of the basis 
vectors? Using the metric of Equation 14.129 to form the inner product of % with 
itself gives 


= 1. (14.131) 
The g; basis vector is still a unit vector. The inner product between §, and 82 is 
in de 1 O 0 
= 0. (14.132) 


The covariant basis vectors of the unprimed frame are still orthogonal. But the 
magnitude of g> is 


=-1. (14.133) 


This says that, while |@2| = 1, & itself must be imaginary! Very odd. 
What about the basis vectors of the primed frame? Using the transformation 
equations given by Equation 14.120, we can write 


8) = Yo 81 + YoBo & (14.134) 
8 a Yo Bo §1 + Yo fo. (14.135) 


Using the unprimed metric, the magnitude of g; is determined by 


1 O Bs 
[¥o  YoBo | E nal ee 
=I, (14.136) 


A 


81° Bi 


i 
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So g| has now become a unit vector. The inner product between 1 and §} is 
ae 1 0 

&) 8) = [ Yo YoBo | E ey bag 

= 0. (14.137) 


Now the primed, covariant basis vectors of the skewed system are also orthogonal! 
Finally, the magnitude of 8} is determined by 


Ap 28 0 0 Bo 
&) = | YBo Yo | E Es 2 | 
= ~-1, (14.138) 


The §) basis vector is an imaginary unit vector, similar to &. 

The inner products in the primed system could have also been performed using 
Mj;, the metric of the primed system. Since the metric is a tensor, the primed metric 
is related to the unprimed metric in the normal manner: 


Mi; = 8°) 8") Mmn- (14.139) 


J} 


In matrix notation, the primed metric is given by 
[M’] = | Yo ee 1 0 | Yo es 
YoBo Yo 0-1 YoBo Yo 
_—{i Oo 
= E 4 , (14.140) 


Notice the form of the metric is identical in the primed and unprimed systems. With 
this metric, the inner product between the covariant primed basis vectors can be 
determined: 


fh =) 
ve 
\\ 
—_ 
— 
© 
I 
or 
Lo 
-— 
rel 
co 
ou 
| ee | 


=i. (14.141) 


it i= 
e 
ll 
— 
i=. 
fo) 
ed 
—a 
Co -=_ 
lo 
— 
| ee | 
: aie career | 
a) 
a 


= 0, (14.142) 


and 


=-1, (14.143) 


These are exactly the same results obtained previously, using the unprimed metric. 
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The fact that one of the basis vectors is imaginary seems like a very strange thing. 
But there is nothing mathematically inconsistent with it. The distance along the the 
axis is now simply measured in terms of “imaginary” meter sticks instead of “real” 
ones. The new metric, while not very intuitive, solves our problems with the simple 
picture in Figure 14.9. The basis vectors are now unit vectors and the invariance of 
x” — (cof) is built into the inner product operation. This weird geometry is often 
called Minkowski space, while the new metric is called the Minkowski metric. 

We actually can devise an equivalent coordinate system which does not use the 
imaginary time axes. In this construction, we write the position vector as 


F = x6, + icot &, (14.144) 


where the basis vectors é, and @2 are now real, orthonormal, unit vectors. The same is 
true in the primed system. In this case inner products are performed with the “normal” 
Euclidian metric. This is equivalent to our earlier construction, using the imaginary 
basis vector, since all we have really done here is pull out a factor of i from the basis 
vector and put it into the coordinate. The problem with this view is that you lose the 
convenient representation of Figure 14.9. There is no way to interlace the two sets 
of axes, represent the same event in both systems as a single point, and visualize the 
transformation with axis-parallel projections. The best we can do is draw two distinct 
coordinate systems, as shown in Figure 14.11. 


Extension to Three Dimensions These ideas can easily be extended to three spatial 
dimensions. The position vector, in the unprimed and primed systems, becomes 


P= x8 + y B +283 + Cot B 


lgl 


= x! Bi + yy! Bh + 2! BS + cot! By. (14.145) 


The Minkowski metric expands to 


[M] = (14.146) 


ooor 
ooro 
or OO 
oo Oo 


| ict ict’ 


, 


x i K 


Figure 14.11 A Second Picture of the Coordinate Systems for Special Relativity 
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The Lorentz transformation between the two systems becomes 


Yo 


an 0 
— YoBo 


0 0 —YoBo 

1 0 0 

01 0 (14.147) 
00 Yo 


In this expression, the relative motion between the coordinate systems is along the x 
direction. The appropriate transformation for an arbitrary direction of relative motion 
can be obtained by a rotational transformation of this matrix. 

The position vector, as written in Equation 14.145, is usually called the position 
“four-vector.” There are many other important four-vectors, some of which are de- 
veloped in the exercises at the end of this chapter. One of their more useful properties 
is that the inner product between any two four-vectors is invariant to coordinate 


transformation. 


EXERCISES FOR CHAPTER 14 


1. Consider the primed and unprimed coordinate systems shown below: 


2 


x’? 


Let the two systems have a common origin. Each system has a set of covariant 


basis vectors %), &2 and 1, &}. 


(a) Find the expressions for x! and x? in terms of x” and x’? and the inverse 


relations. 


(b) Identify the covariant basis vectors in the primed system by drawing their 


magnitudes and directions. 


(c) The vector V, expressed in the unprimed system, is given by 


V = 28 + &. 


What are the contravariant and covariant components of this vector in the 


primed system? 


(d) Using its contravariant and covariant components, determine the magnitude 
of the vector V by forming the inner product of the vector with itself. Show 
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that the same result is obtained if just the contravariant components are used 
with the appropriate metric. 


2. Consider the unprimed, two-dimensional Cartesian system and a primed, skewed 
system, as shown in the figure below: 


2 
i 2 


. 30° 


(a) Determine the set of equations that relate the unprimed coordinates to the 
primed coordinates. 


(b) Determine the elements of the transformation matrices ¢/ ; and gi ip 
(c) Determine the elements of the Mj; metric in the primed system, using the 
elements of the transformation matrix g’ ; and Equation 14.92. 
(d) Show that the elements of the metric in part (c) could also have been obtained 
by using Mj, = 8 - 8}. 
3. We have shown that the metric is a tensor. It can be expressed using purely 


contravariant components as M’/, or with purely covariant components as M;,, 
so that 


M = M,8'# = M86. 


(a) Evaluate the product M*M;,;. 
(b) Use the result for part (a) to simplify the following expressions: 
i. ViM™M jx. 
ii. V'MUM x. 
4. Show that the cross product A X B expressed in a skewed system can be written 
as 


Ai Bigk VE jk 
and determine v. 


5. Starting with Equation 14.92 and taking the unprimed system to be orthonormal, 


show that 
ox™ ox™ 
a 
mi ~ (Sa) (Si): 
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The definition of the metric for the primed system is Mj, = 8}-&;, which depends 
only on the properties of the primed system. The above equation, however, seem 
to also imply that the primed metric depends on the unprimed system. Explain 
this apparent contradiction. 


. Inthis problem, the ideas of contravariance and covariance will be used to analyze 


a two-dimensional polar system. Designate a two-dimensional, Cartesian system 
with coordinates (x, y) > (x!, x”) as the unprimed system. Let the polar system 
be primed, with coordinates (p, 0) —> (x”, x’). The relationships between the 
two sets of coordinates are 


xf = 4 /(x})2 ae (x2)2 
12 -1 3 


(a) Obtain the elements of the metric for the primed polar system, M; , using the 
expression Mj; = g',g', where g/, = dx! /ax"'. 
(b) Show for the metric you obtained in part (b) that 


/ al al 
M;, # 6; -@;, 


where the é/ are the traditional polar basis vectors (@,, €g) defined in Chap- 
ter 3. 

Using contravariant/covariant transformation techniques, obtain the con- 
travariant (#°, 2%) and covariant (&, 89) basis vectors of the polar system 
in terms of the Cartesian basis vectors (€,, €,). Compare these contravariant 
and covariant polar basis vectors to the traditional polar basis vectors (@,, €g). 
Pick a point in the first quadrant and plot é,, é,, €,, €9, &), and £9. 

(d) The displacement vector can be expressed using the covariant polar basis 

vectors as 


(c 


— 


dé = dx"g! = dp, + d0Bp, 


so that the contravariant components of the displacement vector in the polar 
system are simply dp and d@. What are the covariant components of the 
displacement vector in the polar system? 

(e) Using the contravariant and covariant components of the displacement vector 
in the polar system, form the inner product dF - dF. 


Show that if a set of primed coordinates (x, ct) are related to a set of unprimed 
coordinates (x’, ct’) by a Lorentz transformation, then 


x? —(ctyP = x'? - (ety. 


. This problem illustrates the differences between Euclidian geometry and Min- 


kowski geometry. First for the Minkowski geometry: 
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(a) Draw an orthonormal set of axes for an (x, ct)-system and plot the point 
x = 2cm, ct = 3cm. 

(b) On the same plot, draw the x’ and ct’ axes of a primed system, moving at 
velocity v = .58c with respect to the unprimed system. 

(c) Transform the coordinates of the point in part (a) into the primed system and 
indicate the values of (x’, ct’) on the axes of your drawing. 


Now for Euclidian geometry: 


(a) Start with a (x!, x”)-system that is orthonormal. Make a labeled sketch of 
the axes of this system and plot the point x! = 2cm, x? = 3cm. 

(b) Determine the g’ ; and the r'; transformation matrices that relate the (x!, x?)— 
system to a skewed primed system, where the x" and x’? axes lie in the same 
position as the x’ and ct’ axes of the above Minkowski system. 

(c) Transform the point in part (a) into the primed coordinate system and indicate 
the values of (x", x’) on the axes of your drawing. 

(d) Notice and comment on the differences in scaling of the x’ and ct’ axes of 
the Minkowski system and the scaling of the primed axes of the Euclidian 
system. 


9. Consider a wave propagating along the x-axis with the a space-time dependence 
of 


eile tar) 


The time period of the wave is T = 22/w, and the spatial wavelength is 

A = 2a/k. 

(a) Assume that w and k are positive real numbers. Is the wave traveling in the 
positive or negative x-direction? 

(b) Now observe this wave from a coordinate system moving along the x- 
direction, with a positive velocity v = Bc with respect to the original 
system. Should the frequency of the wave in the moving system be larger or 
smaller than w? 

(c) Call the moving frame the primed frame. In this system, the space-time 
dependence of the wave becomes 


i(k! x!+ e's! 
ée , 


The primed quantities can be obtained from the unprimed quantities by using 
the transformation techniques of special relativity. First, it is necessary to 
construct a pair of “four-vectors” (actually, since we have only one spatial 
dimension in this problem, they are “two-vectors’’). The terms in the expo- 
nential brackets can be expressed as the inner product between two of these 
vectors, R and K. R is the position two-vector, 


R = x, + ctd, 
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while K is a frequency two-vector with the components 
K = kg! + (w/c)g?. 


Why are these k and w/c components covariant quantities? 
(d) With these two-vectors defined, their inner product is 


K-R=k + at, 
which is invariant to transformation. Since we are working in Minkowski 


space, why wasn’t the metric used to form this inner product? 
(e) The invariance of the quantity in part (d) says that 


kx + wt = k'x! + w't’. 


What is the physical interpretation of this invariance? 

(f) Use the Lorentz transformations to obtain expressions for k’ and w'/c in 
terms of k and w/c and identify the relativistic Doppler shift of the frequency 
and the Lorentz contraction for the wavelength as observed in the moving 
frame. Does this agree with your intuitive answer to part (b)? 

(g) Show that if 8 < 1 your answer to part (f) goes to the classical limit 


wo’ =at+kv 
ki =k. 


10. Why does Equation 14.122 imply that you cannot accelerate an object past the 
speed of light? 


15 


INTRODUCTION TO GROUP THEORY 


Group theory provides a formal framework for taking advantage of the different types 
of symmetries that pervade science. In this chapter, we introduce the mathematical 
definition of a group, and present several examples of both discrete and continuous 
groups that occur in physics and chemistry. 


15.1 THE DEFINITION OF A GROUP 


A group is a set of abstract elements which can be “multiplied” together. In this 
context, multiplication does not necessarily refer to the common operation between 
real or complex numbers, but rather a more generalized operation which combines 
two elements of the group together. To determine if something is actually a group, 
you must first know both the group elements and the rule for multiplication. 

We will typically denote an entire group using a capital letter, while the distinct 
elements that make up the group are written with a small letter and a subscript. For 
example, the group G might have the elements g), 22, 93,-..,8,. To talk about a 
single, but arbitrary group element, we will often use a letter subscript. For example, 
g; refers to any one of the elements 1, 92,23, ... 

In order for G to be called a group, its elements must obey the following four 
rules: 


1. If g; and g; are elements of G, then their product, g, = g; gj, is also an element 
of G. 

2. The multiplication operation is associative: (g; 2;)g, = gi(gj 8x)- 

3. One element / serves as an identity element, such that J g; = g; 1 = g; for all 
8i in G. 
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4. Each element of G has an inverse, g; = 2; 1 that is also an element of G. The 
product of g; and its inverse equals the identity element: 9; g; T= ple =]. 


Keep in mind that rule #2 does not require that the multiplication be commutative, 
although it may be. That is, g; g; does not have to equal g; g;. If the multiplication is 
commutative for all combinations of group elements, the group is called Abelian. 

Where possible, we will identify the first group element as the identity element, 
whose existence is required by rule #3. That is, g; = J. Notice all operations with 
this particular element are necessarily commutative. 


15.2 FINITE GROUPS AND THEIR REPRESENTATIONS 


In this section we present several examples of groups. As we progress, we will 
gradually accumulate an arsenal of terminology to keep track of the different types 
of groups and their properties. 


15.2.1 The Cyclic Group C, 


Our first example of a group consists of the elements 
G={l, 82, 83, 8at. (15.1) 


The order of a group, which we will call h, is the number of distinct elements it 
contains. The group described in Equation 15.1 has h = 4. If h is finite, as it is in 
this case, the elements form a finite group and the elements are discrete. In contrast, 
infinite groups have h — 0, Infinite groups can have discrete or continuous elements. 
In this chapter, we will limit our discussions to discrete, finite groups and continuous, 
infinite groups. 

The multiplication properties of a finite group can be summarized concisely using 
a multiplication table. The row label of a table identifies the first element in the 
product, while the column label represents the second. If we always make the first 
group element the identity, it is trivial to construct the first row and column of any 
table, as shown for our four-element group in Table 15.1. The first row and column 
of all multiplication tables, for groups of the same order, will be identical. The rest of 
the table depends upon the particular group. For the C4 group, the multiplication table 
is filled in with a cyclic arrangement of the group elements, as shown in Table 15.2. 


TABLE 15.1. Start of a Multiplication Table 
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TABLE 15.2. Multiplication Table for C4 


Notice this table is symmetric about the diagonal and therefore is Abelian. A good 
exercise would be to convince yourself that this table satisfies all the requirements of 
a group. 

Notice that each group element appears exactly once in each row and column of 
Table 15.2. This is not a coincidence, but rather a fundamental property of all group 
multiplication tables. To see this, assume the products of a group element g; with two 
different group elements, g; and g;,, produce the same element g,,. In other words, 


8i 8j = 8m (15.2) 
8i 8k = 8m- (15.3) 


If this were the case, g,, would appear more than once in the i* row of the table. If 
we multiply both these equations by g;!, we get the pair of equations 


8) = 8 8m (15.4) 
Bk = 8) | Bm (15.5) 

which imply 
&j = &k- (15.6) 


This clearly violates our initial assumption that g; # g,. This proof shows that an 
element cannot appear twice in a single row. It is trivial to construct a similar proof for 
columns. The combination of both results is often called the rearrangement lemma, 
because each row and column of a table must contain all the group elements, but with 
different ordering. 

Figure 15.1 shows a simple two-dimensional picture that can be associated with 
the C4 group. Each group element is a rigid rotation of the object in the plane of the 
figure, which leaves the square array in the same physical configuration it was initially. 
These operations are sometimes called symmetry operations. A rigid rotation requires 
the relative distances between the individual objects that make up the body remain 
constant. In the case of the square object of Figure 15.1, imagine sticks between 
the four objects in the corners of the square. The / element clearly does nothing. 
The other elements are integer multiples of 7/2 rotations of the figure around its 
center. The group multiplication table shows how two successive symmetry operations 
are equivalent to another symmetry operation. For example, two 7/2 rotations are 
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c ] 
g,79=0 
g,7 0=n/2 


g,70=n 
8, 9=3n/2 


Figure 15.1 A Graphical Interpretation of the C, Elements 


equivalent to a single 7 rotation. With this picture, the origin of the term cyclic should 
be a little more obvious. 

A representation of a group is a mapping between the group elements and a 
particular set of quantities, which obey the same multiplication table. The most useful 
type of representations use matrices as the group elements, with the multiplication 
between elements accomplished using standard matrix multiplication rules. Any finite 
group has an infinite number of different matrix representations, but as we will see 
later in the chapter, a finite number of “important” ones. 

Our picture of the rotations of the square array in Figure 15.1 immediately leads 
to a simple 1 X 1 representation of the C, group. We can associate four complex 
phasors with the C, elements, as shown in Table 15.3. Using this representation for 
the elements of C4, the multiplication table fills in as shown in Table 15.4. Notice 
how this table has exactly the same structure as Table 15.2. If we wished, we could 
even drop the e’® notation and write the elements of this representation as shown in 
Table 15.5. This does not change the multiplication table in any way. 

A different representation for the C4 group can also be devised by using Figure 
15.1 as a guide for constructing a set of two-dimensional transformation matrices. 


TABLE 15.3. A Representation for C4 
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TABLE 15.5. Another Form of the C, Representation 


Figure 15.2 7/2 Rotation for gz of the C, Group 


Since the go element corresponds to a counterclockwise rotation of 77/2, its effect on 
a two-dimensional orthonormal coordinate system is shown in Figure 15.2. The 2 x 2 
transformation matrix that relates the primed coordinates to the unprimed coordinates 


for this rotation is simply 
x’| | O ffx 
ie i BE (5.7) 


By associating the other group elements with their transformation matrices in a 
similar manner, we can generate another valid representation of Cy. The elements of 
this 2 X 2 matrix representation for C, are shown in Table 15.6. It is easy to check that 
the original multiplication table is still satisfied by multiplying the matrices together. 

It is clear from Figure 15.1, that we can define a cyclic group C;, of arbitrary order 
h, by increasing the number of sides of the polygon. The representations discussed 
here are also easily extended in the same manner. 


15.2.2 The Vierergruppe D2 


The only other possible group of order four has the multiplication table shown in 
Table 15.7. This group is called the vierergruppe and does not demonstrate the cyclic 
nature of C4. Instead, each element is its own inverse, and the product of any two 


TABLE 15.6. A Different Representation for C, 
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TABLE 15.7. The Vierergruppe Multiplication Table 


of the elements, excluding the identity, generates the third. The symmetry about the 
diagonal still exists, so the vierergruppe is Abelian. 

As before, a simple physical picture can be associated with this group. Imagine 
an object, perhaps a molecule, made of three pairs of different atoms. The six atoms 
are arranged as shown in Figure 15.3, Each element of the vierergruppe can be 
associated with a rigid rotation of the system which leaves it in an identical state. 
Clearly, rotations of a around any of the axes are group elements. The fourth element 
is, of course, the identity. We write these elements as R(O), R,.(a), Ry(z), and R,(77). 
We will often make use of this “R” notation for rotations. The subscript refers to 
the axis of rotation, while the argument is the angle of rotation, counterclockwise 
about that axis. The correspondence between the vierergruppe multiplication table 
and this picture needs to be shown. Since applying a 7 rotation twice is the same 
as no rotation at all, the fact that each element is its own inverse is obvious. That 
the product of any two nonidentity elements generates the third is demonstrated by 
Figure 15.4 for the R,(7r) Ry() = R,(7) operation. The other permutations follow 
in an analogous manner. Because of this picture, the vierergruppe is often called Dy. 
The D stands for dihedral, a figure formed by the intersection of two planes, and the 
2 indicates there is a twofold symmetry around each axis. 

We can easily construct a 3 X 3 matrix representation of D2 by considering the 
effect of the symmetry operations on the coordinate axes shown in Figure 15.3. 


Figure 15.3 Symmetry for the Vierergruppe 
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Rn) 
xX xX 
2o= ——> —_ meres 
ye 
Z 
| R(n) R,(n) | 
Z Z 
xX xX 
y = y 


Figure 15.4 Demonstration of Vierergruppe Element Multiplication 


For example, consider the group element R,(77). Figure 15.5 shows its effect on the 
coordinate system. The matrix that converts the unprimed coordinates to the primed 
coordinates in this picture is 


x! 1 O 0 x 
yt=lo0 -1 0 | ly}. (15.8) 
z! 0 0 -!l Zz 


The matrices for the remaining elements can be generated in a similar manner, and 
the complete representation is given in Table 15.8. 


Figure 15.5 Vierergruppe Symmetry Rotation About the x-Axis 
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TABLE 15.8. A 3 X 3 Representation for the Vierergruppe 


Figure 15.6 Object for Threefold Symmetry 


15.2.3 The Threefold Symmetry Group D3 


Figure 15.6 shows an object with a threefold symmetry. Three identical objects are 
placed at the corners of an equilateral triangle situated in the xy-plane. The coordinates 
of each point are labeled in this figure. 

What are the rigid rotations that leave this object unchanged? If we restrict our- 
selves to rotations in the xy-plane, the transformations constitute the C3 group, a 
version of the cyclic group we investigated earlier. Using the “R” notation, the el- 
ements are R,(0), R,(277/3), and R,(47/3). We can write down a simple matrix 
representation of these elements, by determining the 2 < 2 transformation matrices 
associated with each rotation, the C3 version of the representation given in Table 
15.6. The result is given in Table 15.9. 


TABLE 15.9. The Three Elements of C; and a Representation 


R(22/3) R,(42/3) 


Ex 8 |e | 


J3/2 ~1/2 —/3/2  -1/2 
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Figure 15.7 Axes for Interchanging Two of the Threefold Symmetric Objects 


TABLE 15.10. The Six Group Elements of a D3 Representation 


There are other rigid body transformations that leave the configuration of this 
object unchanged. If rotations in three dimensions are allowed, there are three more 
symmetry operations that interchange only two vertices of the triangle. These can be 
viewed as rotations of a about the D, E, and F axes, shown in Figure 15.7. Now a 
total of six group elements have been generated, which are listed along with their 
matrix representation in Table 15.10. 

To finish the story, we need to confirm that these elements form a complete group, 
and determine the multiplication table. This can be done using pictures similar to 
Figure 15.4 or by multiplying each possible pair of the matrix representations together. 
With either method, the result is that these six elements do indeed form a complete 
group, and the multiplication table is given in Table 15.11. This group is called D3. 
Notice that it is not Abelian. 


TABLE 15.11. The Multiplication Table for D3 
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Figure 15.8 Labeling for Operator Representation of the D;.Group Elements 


15.2.4 Operator Representation and Permutation Groups 


Another useful type of group representation has operators as the group elements. 
As an example of this type of representation, consider the D3 group of the previous 
section. To develop the operator elements of this group we label the objects at the 
comers of the triangle with the letters a, b, and c, as shown in Figure 15.8. The 
82 Operation takes the sequence (abc) to the sequence (cab), while the g4 element 
changes the sequence (abc) to (acb). We can define a “rearrangement” operator [132] 
such that 


[132](abc) — (acb). (15.9) 


What this notation means is that the first value between the initial parentheses gets 
mapped onto the first value between the final parentheses. The second and third values 
are interchanged. All of the elements of the D3 group can be expressed as a set of 
these rearrangement operators as described by Table 15.12. 

Multiplications for the elements of D3 are easily performed with these operators. 
The product of the two elements 


82 84 = &5 (15.10) 
TABLE 15.12. Operator Representation for D3 


[312] [231] [132] [213} 
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is simply written as 
[312][{132] = [213]. (15.11) 


Notice how the product of two elements is now effectively built into the notation. 
We do not need to reference a multiplication table to determine the result given in 
Equation 15.1], because the first operator [312] applied directly to [132] gives the 
result [213]. 

The operator representation is very useful for discussing the general class of 
permutation groups. The S,, permutation group describes the indistinguishable ways 
that n identical objects can be arranged. The identity element of this group can be 
represented by the operator [1234 - - - n], while the other elements have operators that 
are generated by all the possible permutations of the numbers | through n. The S, 
group is therefore of order n!. 


15.3. SUBGROUPS, COSETS, CLASS, AND CHARACTER 


There are several ways of sorting the elements of a group into categories. This section 
describes the four most important classifications for group theory: subgroups, cosets, 
class, and character. 


15.3.1 Subgroups 


A subgroup is a subset of the elements of a group that, by themselves, form a 
legitimate group. The D3 group, whose multiplication table was given in Table 15.11, 
has several subgroups. One, consisting of the elements /, 22, and g3, is easy to spot in 
the upper left-hand corner of the table, and its multiplication table is shown in Table 
15.13. There are three other subgroups that consist of two elements each; a subgroup 
formed by J and g4, one from / and gs, and the last from J and gg. Each of these two 
element subgroups satisfies a multiplication table similar to the one shown in Table 
15.14.Notice the identity element must be a part of any subgroup. Actually, if we 


TABLE 15.13. Multiplication Table for a Subgroup 
of D3; 


82 


TABLE 15.14. Another Multiplication Table for a 
Subgroup of D; 


f ° 
I 


84 84 I 
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want to be rigorous, D3 has two additional “trivial” subgroups. First, the entire group 
itself is considered a subgroup. Also, the identity element all by itself is a subgroup. 
These trivial cases occur for all groups, and are usually ignored. 


15.3.2 Cosets 


A coset is the set of elements formed by multiplying all the elements of a subgroup 
by an element of the group. Let the group G have order h and consist of the elements 
21,22) °° ° &n- For the purposes of this discussion we will use the symbol G to represent 
all the elements of the group, in any sequence: 


G = {g1, 82, 83, °°" ent = (23, 8h, 81, “°° 82} = etc. (15.12) 
This convention lets us write 
giG =G, (15.13) 


which is just a compact way of writing the rearrangement lemma. Assume G has a 
subgroup S of order h’, with elements 


S = {s1, 52, 53, +7* Sp}. (15.14) 


Let g, be an element of G that may or may not be in S. If we premultiply g, by all 
the different members of S, we get a set of h’ distinct elements called a right coset 
of S. We write the collection of products as S g,: 


S gy = {81 Bx, $2 Bx, 53 8x, °°" Shi Bxh- (15.15) 


Notice that this set of elements does not necessarily form a group. If g, is in S, then 
the coset is equivalent to the group S. If g, is not in S, every element of the coset must 
be an element that is not in S, and the coset cannot form a group because the identity 
element is missing. There are h possible choices for g,, so there are h possible right 
cosets of S. If instead, we put g,. first in the product operation, we get a left coset: 


8x S = {gx 51, Bx $2, 8x53, °° ° Bx Swe (15.16) 


Again, there are h possible left cosets of S. 

It should be noted that some texts define cosets with the additional requirement 
that g, not be an element of S. The distinction is not very important, since it only 
eliminates the cosets that are equivalent to S. Our choice for forming the cosets helps 
to clarify the following discussion. 

The right and left cosets of S are not necessarily distinct. An important theorem 
says that the elements of two cosets of S either have all the same elements, or no 
elements in common. To see this, consider two right cosets, Sg, and Sg,. Now imagine 
these two cosets have an element in common. That is, for some choice of s; and s;, 


Si Bx = Sj By. (15.17) 
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Premultiplying this expression by s;' and postmultiplying by gy! gives 


Sj! 5; = By 8;'. (15.18) 


The element generated on the LHS of this equation is an element of S, so the product 
&y g, | on the RHS must be an expression for this same element of S. Thus we can 
write 


Sgyg,'=S, (15.19) 
or equivalently 
S gy =S gy. (15.20) 


Consequently, if the same element appears in two different cosets, the two cosets 
have all their elements in common. Conversely, if a particular element is present in 
One coset but not another coset, there can be no common elements between these two 
cosets. 

The elements in any single coset are distinct. That is, no element can appear more 
than once in any given coset. This is a direct result of the rearrangement lemma— 
every element must appear once and only once in every row and column of the 
table. Therefore, every coset contains exactly h’ different group elements. Therefore, 
because any two cosets will either have all elements in common or no elements in 
common, the order of the group h must be an integer multiple of the number of 
elements in any of its subgroups. That is, 


= nh’ where n = a positive integer. (15.21) 


This is demonstrated quite nicely by the subgroups of D3, discussed earlier. The D3 
group has six elements, while its nontrivial subgroups have orders two and three. 


Example 15.1 As an example, consider the right cosets formed from one of the 
subgroups of D3. Remember D; consists of six elements, which obey the multiplica- 
tion Table 15.11. Earlier we found a subgroup formed from the first three elements 
of this main group: 


S = {I, g2,83}- (15.22) 


There are six right cosets associated with this subgroup, each with three elements, as 
shown in Table 15.15. The first three cosets are formed by multiplying the subgroup 


TABLE 15.15. Right Cosets of a Subgroup of D3; 


S 81 S g2 S g3 S g4 S gs S 86 


II=I T g2 = 82 T 93 = g3 T g4 = & T a5 = 85 T 86 = & 
& I= §2 82 = 83 82 gal 82 84 = 8&5 82 85 = & 82 86 = 84 
83 1 = g3 83 g2 =1 83 8&3 = 82 83 84 = 86 83 85 = 84 83 86 = 85 
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by an element in the subgroup. These cosets therefore always contain the same three 
elements as the subgroup S. The last three cosets are formed by products of S$ with 
group elements that are not in S. Each of these cosets contain three distinct elements 
of G that are not in S. 


15.3.3 Class and Character 


Another way to sort group elements is by their class. If a and b are elements of a 
group, they are in the same class if we can find another group element g such that 


giag=b. (15.23) 


This type of process, which involves an operator and its inverse acting on a, is called 
a similarity transform of a. Notice that the identity element must always be in a 
class by itself, because any similarity transform of the identity element will always 
generate the identity element. 

In matrix algebra, the trace of a square matrix is the sum of all its diagonal 
elements. In subscript notation, it can be written 


Tr[b} = bi, (15.24) 


where the doubled subscript implies a sum over the index i. It is easy to prove the 
matrix representations of all the group elements in one class must have the same 
trace. Suppose [a] and [5] are two matrix elements of a group, related by a similarity 
transform using group matrix elements [g] and [g]!, in the matrix form of Equation 
15.23, 


[g)‘fallg] = [6], (15.25) 
or in subscript notation: 

837 4 n8e = Di (15.26) 

The trace of [b] becomes 
bit = 8);'QjeBki (15.27) 
= 8ki8))' Ox (15.28) 
= yj jk (15.29) 
= ay. (15.30) 


Thus [a] and [b] must have the same trace if they are in the same class. 

This property turns out to be so important that the trace of a matrix representing a 
group element is given its own special name. In group theory, it is called the character 
of the element, and in this text is denoted as y. We have shown that matrix elements 
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representing group elements of the same class will always have the same character. 
Keep in mind this does not always work the other way around. It is possible to have 
two matrix elements that are in different classes, but have the same character. 


Example 15.2. The group D3 has three different classes. As always, the identity 
element is in a class by itself. The two elements g, and g3 are in a second class, 
while the elements g4, g5, and ge form a third class. This can be easily confirmed 
by performing the similarity transformations. If you look back at Figure 15.6, it can 
be seen that these three different classes for D3; correspond to three very different 
types of symmetry operations. The identity element is unique because it is the only 
operation that represents no change to the orientation. The elements g2 and g3 are 
both operations which rotate the system in Figure 15.6 in the plane of the page, 
interchanging all three of the vertices of the triangle. The elements g4, gs and g¢6 
are operations where only two of the objects are interchanged, while a third is held 
fixed. This gives an intuitive feel for what a class really is. It is a collection of all the 
symmetry operations which do “the same type of thing.” 

A list of the characters for a 2 X 2 matrix representation for D3 are shown in Table 
15.16. The first column of this table contains the 2 X 2 matrix representation of the 
group elements we found in Table 15.10. The second column indicates the character 


TABLE 15.16. List of Characters for a D3 
Representation 
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of each matrix element. Notice how we have subdivided the elements by their class. 
For this representation, each class has a different character, but remember this will 
not always be the case. 


15.4 IRREDUCIBLE MATRIX REPRESENTATIONS 


This section introduces irreducible representations, an enumerable set of important 
representations which exist for any finite group. In analogy with the basis vectors of 
a linear space, these irreducible representations are orthogonal and complete. This 
means, among other things, that any representation can be written as a sum of the 
irreducible representations, a fact that has great consequences in the application of 
group theory to physical problems. 

The derivation of many of the results of this section, including the comments in 
the previous paragraph, are too involved to present in this short introduction to group 
theory. In addition, there are many facets of the subject that we will not discuss. 
Only a few important results are presented here. The complete story, along with 
formal proofs, can be found in many textbooks devoted solely to group theory. An 
introductory, but complete treatise of group representation theory can be found in the 
third chapter of the book by Wu-Ki Tung. The books by Hamermesh and Wigner also 
give a very complete discussion of group theory. 


15.4.1 Notation 


Because there are many possible matrix representations for a group, we need a 
notation for distinguishing them. When we discussed the group C4, we explored the 
two different representations given in Table 15.3 and Table 15.6. We will call the 
first C’?) and the second c/41, as shown in Table 15.17. The brackets are used as a 
reminder that this notation refers to matrix representations only. You will discover 
the justification for labeling some representations with bracketed numbers and some 
with bracketed letters in the sections that follow. Also shown in Table 15.17 is the 
representation cy, a one-dimensional representation made up of all 1’s. This trivial 
representation will work for any group, and we will always reserve the bracketed 
index "I for it. 


TABLE 15.17. Three C, Representations 
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TABLE 15.18, The Direct Sum of Two C, Representations 


i I &2 83 84 


ce 1 i es = 


1 0 
a) [oo 
0 
0 
1 


1 
CP = CP} ce f 
0 


To refer to a single element in a particular matrix representation, we use a modi- 
fication of this notation. For example, the Cc matrix representation of the g3 group 
element is written as oll 1 If we need to eles to one arbitrary element, we use a 
letter as the subscript. For example, git | refers to any one of the matrices in the Cy iA) 
representation. 


15.4.2 Direct Sums of Matrix Representations 


The matrix representation of a group can be of any dimension. In fact, given any 
matrix representation of a group, it is easy to expand it to a higher dimension by 
adding “blocks” of another representation. This process can be demonstrated using 
the C, cyclic group. Consider the co and the cA representations shown in the first 
two rows of Table 15.18. The Cl”! representation can be looked at as a set of 1 X 1 
matrices. They can be added to the on matrices in a block diagonal sense to form 
the new 3 X 3 matrix representation listed in the last row of Table 15.18. The © 
symbol, called the direct sum operation, is used to indicate this type of block diagonal 
addition: 


cP! 0 
cl = ®t = sil (15.31) 
4 


It is easy to see that the set of ce @ co” matrices form a valid representation 
for the C4 group. When any two of these elements are multiplied together, the 
block diagonal form insures there is no mixing between blocks. Since each block 
independently satisfies the C, group multiplication table, the direct sum must also 
satisfy the same table. 


15.4.3 Reducible and Irreducible Representations 


Given a particular matrix representation G! for a group G, 


Gal = { si, gil, gldl,... ae (15.32) 
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we can generate another representation G¥! by simply transforming the coordinate 
basis. If [T] is the transformation matrix, then the second representation is related to 
the first via the relations 


gil = (ry g!I[r). (15.33) 


Keep in mind, the same transformation matrix [7] must be applied to all the gif) 
elements. It is obvious that both representations are of the same dimension and obey 
the same multiplication table. Notice Equation 15.33 is in the form of a similarity 
transformation, like that of Equation 15.23, so the matrices of the new representation 
will have the same character as the original representation. 

Two matrix representations are said to be equivalent if you can obtain one from the 
other using the type of operation described in Equation 15.33. In contrast, inequivalent 
representations cannot be generated from one another by simply transforming the 
coordinate basis. Obviously, two representations of different dimension will always 
be inequivalent, but it is also possible to have two inequivalent representations of the 
same dimension. 

If, using the same transformation matrix, all elements of a representation can be put 
into the form of a direct sum of two or more smaller blocks, the original representation 
is said to be reducible. If it cannot, it is irreducible. The irreducible representations 
of a group are important because any representation can be transformed to a direct 
sum of one or more irreducible representations. We will discuss this in more detail, 
later in this chapter. 

It turns out that the number of inequivalent irreducible representations is not 
infinite, but rather equal to the number of different classes in the group. In order to 
distinguish these enumerable irreducible representations from the infinite possible 
reducible ones, we use a slightly different notation for the two. If a representation is 
reducible, we use a letter in the brackets. For example, the reducible representation 
of C4 we developed in Equation 15.31 was called C!?!, The representation C!?! is 
obviously irreducible (because it is one-dimensional) and so is labeled with a number. 


Example 15.3 As a simple example of the reduction of a representation, consider 
the cil representation of the C4 cyclic group, which was shown in Table 15.17. 
We would like to know if, using some similarity transformation, can we write this 
representation as the direct sum of two 1 X 1 matrices. In other words, is this 
representation reducible? 

Since these matrices are two-dimensional, to reduce them any further is equivalent 
to diagonalizing them. In Chapter 4, we discussed a general method for finding the 
transformation matrix which diagonalizes a given matrix. Let’s apply this method to 
find the matrix [JT] which diagonalizes the oh) matrix element of ox. First, we find 
the eigenvalues using the determinant equation 

-A 1 
| oq) ay | = 0, (15.34) 
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TABLE 15.19. Block Diagonalization of C'*! 


which gives 
a (15.35) 


These eigenvalues generate the two eigenvectors 


1 1 1 1 
—). 5; aie (15.36) 
We | i J/2 Z| 
From these we can construct the transformation matrix 
1 
rye E zl (15.37) 
and its inverse 


Tie E ‘4 ; (15.38) 


Because we constructed it specifically to do so, applying [7] and [T]~! to the 
gif! matrix in a similarity transformation will certainly diagonalize it. The real test, 
however, is if the same transformation simultaneously diagonalizes all the other 
elements. Table 15.19 shows the result when the same similarity transformation is 
applied to the rest of the matrix elements. Notice, these elements are also diagonalized, 


so C\“1 is reducible and can be written as the direct sum 
ce ch, (15.39) 


where Ci”! is the new representation shown in Table 15.20. Since Cl?! is one- 
dimensional, and is obviously different from cs and cn; it must be a third ir- 
reducible representation of C4. 

Earlier we stated that the number of irreducible representations is equal to the 
number of classes. The number of classes of C, is four, but so far we have only 
identified three different irreducible representations. One way to discover the fourth 


TABLE 15.20. Another Irreducible Representation for 
Cs 
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set would be by trial and error. A more educational method is to derive it using the 
orthogonality relations of the irreducible representations, a topic we explore later in 
this chapter. 


Example 15.4 As another example of representation reduction, consider the D3 
group. We showed a 2 X 2 matrix representation for this group in Table 15.10. An 
attempt to diagonalize this representation in the same way we did for the previous 
example fails. Therefore, it is an irreducible representation, which we will call Dy, 
As always, pi) is the trivial one-dimensional irreducible representation. 

A3X3 matak representation for this group can be constructed from the operator 
representation developed in Table 15.12. For example, take the [231] operator. It 
generates the sequence (bca) from the sequence (abc). This operation can also be 
accomplished by a3 X 3 matrix 


01 0 a 
00 1 bl = fc]. (15.40) 
1 0 0 c 


This matrix is a valid representation for the g3 element of D3. The matrices for all 
the other group elements are constructed in a similar manner. The result is shown in 
Table 15.21 and we will refer to this as the DM representation. Now we ask, is this 
representation reducible? 

We can attempt to reduce this matrix by a similar process to the one used earlier. 
We will diagonalize one of the group elements, and then check the other elements to 
see if the same transformation puts them into a reduced form. We have no guarantee 
that this will work in this case, because there is the possibility that none of the 
elements (besides the identity) are diagonal in the reduced representation. Later in 
the text, we will revisit this problem, using a more reliable technique based on the 
orthogonality and completeness properties of the irreducible representations. 

We have a small hint from looking back at the 2 x 2 bY representation. Its g¢ 
element is diagonalized. Since the DPI matrices are an ‘reducible representation, it 
would make sense to attempt diagonalization of the same element in the 3 X 3 Di) 
representation, and see what happens. Luckily, this process actually works, and the 
transformed version of the set of pi) matrices is given in Table 15.22. Notice this is 
just the direct sum D!"! @ D#!. 


TABLE 15.21. The 3 x 3 Di"! Representation 
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TABLE 15.22. Block Diagonalized of the D“) Representation 


We already determined that D3 has three different classes. This means that there 
should be three irreducible representations. So far, only two have been identified, D}!! 
and DG). In the next section, we determine the remaining irreducible representation 
D?!, using the orthogonality of the irreducible representations. 


15.4.4 Orthogonality of Irreducible Representations 


There are several important rules that the irreducible matrix representations obey: 


1. The number of irreducible matrix representations for a group is equal to the 
number of different classes of the group. 


2. If nl"! is the dimension of the first irreducible matrix representation, n! the 
dimension of the second, and so on, then 


(nll? + (nly? a (ny? +--+ =h, (15.41) 


where the summation is over all the irreducible representations and h is the 
order of the group. 


3. “Vectors” formed by the elements of the irreducible representation matrices are 
orthogonal. In this context, a vector is formed by combining the elements of 
a representation in a very particular way. The components of one such vector, 
for example, are formed by all the 1-2 matrix elements of the six Dy matrices. 
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The components of another are formed by all the 2-2 elements of the six DY! 
matrices, etc. 

4. The magnitude squared of each of the “vectors” described in rule #3 is equal to 
h/n'4 where n® is the dimension of the matrix representation used to construct 
the vector. 


When checking orthogonality or checking the magnitude of the vectors described 
in rules #3 and #4, be aware that, if any of the components are complex, one set 
of the vector components should be complex conjugated before performing any dot 
products. 

In both the examples of the previous section, we found there was one missing 
irreducible representation. Instead of using guesswork to find the last representation, 
it is instructive to use the above rules to determine it. 


Example 15.5 For the C, cyclic group, three irreducible representations were iden- 

tified: cl C an and C. ee There are four different classes in this group, and according 

to rule #1, there must be another irreducible representation which we will call cH ; 
First let’s determine the dimension of the C!"! matrices. According to rule #2: 


(nly? + (nl2h2 + (nBly? + lly? = 4 (15.42) 
or 
pelea (15.44) 


So the final irreducible representation for the C, group is composed of four 1 X 1 
matrices. The identity element must be 1, and according to the multiplication table 
for C4, g3 is its own inverse, so gf = +1. Also, gq is the inverse of g2, so if we let 
gh! = a, then gi] = 1/a, Notice we have given a the flexibility of being complex. 
This information leads us to the partially specified Table 15.23. Now we just need to 
determine the value of a and the sign of 2"). 

A table of all of the four irreducible representations for C4 is shown in Table 15.24. 
Notice all the representations are 1 X 1 matrices. Incidentally, this turns out to be a 
general property of any Abelian group. Following the prescription described in rule 


#3, we form four, four-dimensional vectors, which we call Vi! ¥!, V"), and ve 


TABLE 15.23. The Form of the Missing Irreducible 
Representation of C, 
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TABLE 15.24. Partially Specified Table of the 
Irreducible Representations of C, 


Vis vi_| 2 (15.45) 


i . 
—— 

\ 

-_ 

I+ 

- 


According to the rule, these vectors must be orthogonal. If V"") is orthogonal to vi 
with a = —1, we find that a = +i. Either choice for the sign of a does not lead 
us to a representation of C4 that we have not seen before. But if instead we pick 
gi"! = +1, we obtain a new irreducible representation with a = — 1. Now our set of 
irreducible representations is complete. They are summarized in Table 15.25. 
According to rule #4, the magnitudes of V'", V7, V"!, and V™ should all be 
equal to 4. Taking the dot product of each vector with itself, remembering to take the 


complex conjugate of one of the terms, shows this is indeed true. 


Example 15.6 The same procedure used above can be repeated for the D3 group. 
This group is of order six, and has three classes. We have already identified two 
inequivalent irreducible representations: pi! and DP), That leaves a third, the a 
representation, for us to find. 

Using rule #2, the dimension of py is given by 


(nfl? + (nl2l? + (n8))? = 6 (15.46) 
1+)? +4 =6 (15.47) 
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TABLE 15.26. The Form of the Third Irreducible Representation for D3 


or 
(nl) = 1. (15.48) 


So the irreducible representation we seek is a set of six 1 X 1 matrices. According to 
the multiplication table for D3, the g4, gs and g¢ elements are all their own inverses, 
and are also in the same class. This lets us write gi! = g?) = gi] = +1. Also 
according to the multiplication table, 92 g4 = gs and g3 84 = &6, So we must have 
gil = gi = +1, These properties give us the partially specified representation in 
Table 15.26. If we take the positive sign for all these terms, we just get a repeat of 
the trivial representation D!!!. Therefore we take the negative signs instead, and the 
complete set of irreducible representations for D3 are listed in Table 15.27. 

It is educational to check that the other group rules hold for these representations. 
Rule #3 specifies that we can form “vectors” from the elements of the matrices, and 
that these vectors should all be orthogonal to one another. There will be six of these 
vectors. The 1 X 1 D!!) representation forms the first vector: 


1 
1 
1 
1 (15.49) 
1 
1 
The second vector comes from the pel representation: 
TABLE 15.27. The Irreducible Representations for D3 
I &2 &3 84 85 86 
2 ae | 1 1 1 1 1 
Det 1 1 -1 = -1 
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1 
1 
] 
a (15.50) 
-1 
-1 
The 2 X 2 D3! representation generates the last four: 
l 0 0 1 
—1/2} | -~v3/2} | v3/2 | |-1/2 
-1/2) | V3/2 | |-v3/2{ |-1/2 eee 


1/2 —V/3/2} | -/3/2] | -1/2 
1/2 4/372 /3/2 = 
0 0 


Taking the dot product of all fifteen possible pairs of these vectors shows they are 
indeed orthogonal. According to rule #4, the vectors which come from pi and pel 
should have magnitude squared values of 6 and those from Dev! should have 3. This 
is also obviously true. 


15.4.5 Character Tables and Character Orthogonality 


Once all the irreducible representations are known for a particular group, it is useful 
to construct a character table. This table lists the character of each group element for 
each of the irreducible representations. A character table for the D3 group is shown 
in Table 15.28. 

Similar to the orthogonality rules presented in the previous section, the character 
tables have orthogonality rules of their own. In this case, the “vectors” formed by 
the columns of these tables are orthogonal to one another, while their magnitudes 
squared are equal to the order of the group. It is easy to check that these rules hold 
for the D3 character table. 


TABLE 15.28. A Character Table for the Irreducible 
Representations of D; 
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TABLE 15.29. A More Compact Character Table for D3 


§2, 83 
84,85, 86 


Since elements in the same class always have the same character, the character 
table is often written more concisely by putting all the elements in the same class in a 
single row. The compact character table for D3 is given in Table 15.29. When forming 
dot products between the columns of these character tables, you need to weight each 
term by the number of elements in each class. 


15.4.6 Completeness of the Irreducible Representations 


The real power of the irreducible representations stems from the fact that, using 
the proper similarity transformation, any matrix representation can be written as a 
direct sum of one or more of the irreducible representations. A particular irreducible 
representation may appear once, more than once, or not at all. In other words, with 
some [7], any group representations G!1, can be written 


(TY G47] =GU@GUIe-:--. (15.52) 


This is often called the decomposition of a representation. 

There is a remarkable result of group theory which says that the number of times a 
particular irreducible representation appears in the direct sum of Equation 15.52 can 
be determined just by looking at the character table. Take the vector formed by the 
characters of the representation you want to decompose. Then form the dot product 
of this vector with one formed from the character values of a particular irreducible 
representation. Divide the value of this dot product by the order of the group, and the 
result will be equal to the number of times the particular irreducible representation 
appears in the decomposition. Notice how the irreducible representations are playing 
a role analogous to that of the basis vectors of a coordinate system. Because all matrix 
representations can be expanded this way, the irreducible representations are said to 
be complete. 


Example 15.7 In a previous example, we determined that the 3 x 3 Di‘) matrix 
representation in Table 15.21 could be transformed and written as the direct sum 
DN >) De, We arrived at the result using the diagonalization process and a little 
guesswork. Let’s see how the decomposition rule described above gives us this result 
with much less work. The character vector for the Table 15.21 representation for pi) 
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(15.53) 


} 
He eK COW 


Now take the dot product of this vector with each of the three vectors of the D3 
character table listed in Table 15.28, and divide by 6. The net result is that there is one 
pit representation, one pe representation, and no DY) representation in the block 
diagonalization. This is exactly the result we found before. 


15.4.7. Shur’s Lemma 


Many practical applications of matrix representations center around an important 
group theory result known as Shur’s lemma. Imagine we have a group G, and G"! is 
one of its matrix representations: 


Gh) = {gh, ght), ts ght. (15.54) 


Now suppose we have a matrix [V] which is invariant under similarity transformations 
by all these matrix elements of G”!, That is, 


gl "vg = [V] fori = 1,2,+°+h. (15.55) 


Shur’s lemma says if [V] satisfies Equation 15.55, and G™) is an irreducible repre- 
sentation, then [V} must be a multiple of the identity matrix. That is, 


[V} = Ali], (15.56) 


where A is some constant. If [V] satisfies the conditions of Equation 15.55, it is said 
to commute with the elements of G"!, This terminology is used because an equivalent 
way of writing Equation 15.55 is 


[V]g!) = 2 EV}. (15.57) 


If G™! is reducible, we cannot use Shur’s lemma directly. But remember from our 
previous discussion, any reducible representation can be decomposed into the direct 
sum of two or more irreducible representations by using the appropriate coordinate 
transformation. For example, imagine the decomposition of G®! looks like 


G4 9 oO 
(Tl 'G" |r} =| 0 G2 o |, (15.58) 
0 0 GS) 
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where GUI, Gl), and G!) are irreducible representations of dimension 1, 2, and 4 
respectively, and [T']] is the transformation matrix that puts the matrices into the block 
diagonal form. If [V) commutes with all the elements of G!, the transformed matrix 
[V'] = [T]“![V][T) will commute with all the elements of the transformed G™! 
representation. Then, by Shur’s lemma, the [V]’ matrix must be diagonal, with the 
form 


(V'] = (71 VT] = (15.59) 


oooocoo,r 
oeoooor°o 
ecoooroceo 
ocoor,oceo 
oorococo 
CoToooo°o 

cooooceo 


There are three different constants along the diagonal of {V’], a different one for 
each block of the decomposition of G”!, This works because Shur’s lemma applies 
separately to each irreducible block. 


Example 15.8 We now present an example that shows how many of the ideas of 
group theory can be applied to a problem from classical mechanics. Consider the 
two-dimensional system of balls and springs shown in Figure 15.9. Each of the three 
balls has mass m, and the three springs have the same spring constant k. What are the 
normal frequencies of vibration for this system? Because of the symmetry, we should 
expect group theory to be useful in solving this problem. We begin with a review 
of the standard method of determining the normal modes of such a system and then 
show how group theory simplifies the process. 

Since there are three masses and we are working in two dimensions, we need 
a total of six coordinates to describe an arbitrary state of the system. We will use 
the coordinates q; and q2 to describe the displacement of the mass in the lower-left 


Figure 15.9 Coupled Mass Spring System 
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Figure 15.10 Coordinates for the Mass Spring Problem 


corner relative to its equilibrium position, as shown in Figure 15.10. Likewise, there 
is one pair of coordinates for each of the other two masses shown in the figure. If we 
assume the distance between the masses is much greater than the magnitude of the 
displacements, the potential energy stored in the springs can be written as 

(qa — a) ; 

V= st + exc — q3) + Bae = aa)| : (15.60) 
2 

+ [3a — qs) + Bq = 9s)| 
It is convenient to arrange the potential energy of Equation 15.60 into the form of a 
matrix [V], which is defined so that 


1 
V= 5kVija19), (15.61) 


where there is an implicit sum over both i and j. A bit of algebraic manipulation of 
Equation 15.60 gives the [V] matrix as 


5/4 3/4 1 0 ~1/4 -4/3/4 

V3/4 3/4 0 0 6-3/4 -3/4 

iM = 1 0 5/4 -V/3/4 -1/4 3/4 
0 0 —/3/4 3/4 /3/4 3/4 


1/4 ~—/3/4 -1/4 3/4 1/2 0 


—VJ3/4 -3/4 3/4 3/4 0 3/2 
(15.62) 


To determine the equations of motion, we can use Newton’s second law to write 


dq; ov 
m dpe = agi =i KVijqj- (15.63) 
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A normal mode is a state of motion that oscillates with constant frequency. To find 
the normal modes, we therefore require all the coordinates oscillate at the same 
frequency, 1.e., 9; — ge. Applying this to Equation 15.63 gives 


= Vii ;- (15.64) 


But notice this is the same as a typical eigenvalue problem, 
Vid, = AG (15.65) 


where the eigenvalues are related to the normal frequencies by 
A=—. (15.66) 


This means that to find the normal frequencies we just need to diagonalize the [V] 
matrix. Unfortunately, diagonalizing a typical six-dimensional matrix is no easy task, 
since it involves finding the roots of a sixth-order polynomial equation. But the 
underlying symmetry of this problem, plus some physical reasoning, will let us find 
these eigenvalues much more easily. 

The physical system clearly has a D3-type symmetry. The representation of D3 
we need to consider for this problem is the set of 6 X 6 transformation matrices that 
correspond to each symmetry operation acting on the g; coordinates. Let’s work out 
the g> element of this matrix representation in detail. Looking back at Table 15.10, 
this element corresponds to a rotation of 27/3 counterclockwise around a z-axis 
sticking out of the page of Figure 15.10. Consequently, this operation rotates the 
qi coordinates into the g/ coordinates as shown in Figure 15.11. We can write the 


/ 


/ R,QnI3) 
yf \ 


q 9, 


Figure 15.11 Coordinate Transformation for R,(277/3) Element 
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relationships between the primed and unprimed coordinates as 


lid. Asa Voi-! 
qQ2 


1 = 545 “> 46 = 545 7 5% 
Oe ee aes Lie mee 
3 511 2 q2 q4 2 q a2 
Ins 373 We eo! 
5 = — 593 7 “5 U5 qo = me iz 7d (15.67) 


so the matrix which transforms the primed coordinates into the unprimed coordinates 
is 


0 0 0 0 -1/2 -V3/2 
0 0 0 0 “f3/2. -=1/2 
-1/2 -/3/2 0 0 0 0 
ROHN Tia tig 0 0 0 
0 0 ~1/2 -V/3/2 0 0 
0 0 3/2 -1/2 0 0 
(15.68) 


Repeating this process for the other D, elements generates the complete representa- 
tion, given in Table 15.30. In addition, this table lists the character of each matrix. 
These matrices are reducible, and we can use the completeness and orthogonality 
of the character tables to determine how many times each irreducible representation 
of D3 occurs. Looking at Table 15.30, the character vector for this representation is 


(15.69) 


oooooca 


If we form the dot product with the columns of the irreducible character table for D 
given in Table 15.28, we find the decomposition for the 6 X 6 representation is: 


DI! @ D?! @ DB! @ DE. (15.70) 


Because the potential matrix must obey the underlying symmetry of the system, 
it must be invariant to similarity transformations performed by all of the elements of 
the 6 X 6 representation for D3. By Shur’s lemma, the transformation matrix [T] that 


TABLE 15.30. 6 x 6 Matrix Representation and Character Values for the Spring 
Problem 


0 Ape V3 
0 4/3/2. =1/2 


~1/2 Vf3/2 

~/3/2 1/2 0 
0 0 -1/2 +/3/2 
0 —/3/2 =1/2 
-1/2 3/20 0 0 
=4/3/2 1/2 0 0 


R,(41/3) > 


0 0 
0 0 
R,(22/3) > " 2 " 2 : : 


0 0 0 1/2 -/3/2 

0 0 0 =/3/2 -1/2 
0 1/72, ~=/3/2 0 0 
0 0 -/3/2 -1/2 0 0 
1/2 -/3/2 0 0 0 0 
—/3/2 1/2 0 0 0 0 


1/2 /3/2 0 0 
/3/2 $172 0 
| 0 0 1/2 3/2 
a 0 0 V3/2 -1/2 
0 0 
0 0 
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block diagonalizes the 6 X 6 representation will place the [V] matrix in the form 


[V'] = (T]“[V][T] = (15.71) 


cooocoo,r 
ooooro 
coor7v oo 
oor7voo°o 
eorooce 
PpPoocooo 


where we have used [V’] to represent the [V] matrix in this diagonalizing coordinate 
system. Already group theory has reduced the number of unknown eigenvalues of the 
{V] matrix, and therefore the number of normal mode frequencies for our mass spring 
problem, from six to four. Notice that this has been accomplished without actually 
finding the elements of the [J] matrix. 

The next simplification comes not from group theory, but rather from some general 
physical reasoning. When finding the normal modes of any unconstrained system, 
there will always be a number of modes with zero frequency. These correspond to 
rigid displacements of the system. In a two-dimensional system, such as this example, 
there must be exactly three such modes: two translational and one rotational. We can 
pick A, and the two A3’s to be the eigenvalues associated with these modes, which 
simplifies the [V’] matrix to 


[Vv] = (15.72) 


coooo°o 
ooooro 
ooooo$o 
coocoooo 
orpoococo 
rPooooce 


Now the problem has only two unknowns. 

We can determine the values of the remaining eigenvalues by remembering that 
the trace of a matrix is invariant to similarity transformations. Therefore, the trace of 
the original [V] matrix must equal the trace of [V’): 


Tr{V] = Tr[V’] 
6 = Az + 2Agq. (15.73) 


There is only one free parameter left, and we need one more condition to specify 
things completely. The simplest way to obtain it is to consider the squares of the 
potential matrices: [V]? and [V’]*. The trace of these two matrices must also be 
unaffected by similarity transformations, so we can write 


Tr ({(V)’) = Tr (IV'l) 


2 = Ap? + 2A4?. (15.74) 
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The simultaneous solution of Equations 15.73 and 15.74 gives the results A. = 3 and 


My = 3/2. 
Combining these results with Equation 15.66, we finally have the normal frequen- 


cies: 
wo, = 0 w= eye Py ct ee (15.75) 
2Vm m 


The first two frequencies are degenerate, since w, occurs three times and w, twice. 


15.5 CONTINUOUS GROUPS 


So far, all the groups we have discussed have had a finite number of discrete elements. 
There is another type of a group that has an infinite number of elements, described as 
a function of one or more continuous variables. This section briefly discusses three 
examples of this type of group. 


15.5.1 The 2D Rotation Group 


The C, group consisted of four elements that could be viewed physically as the 
rotations in a plane which left a square in a state indistinguishable from its initial 
state. We mentioned that this group could be generalized to C,, to handle the symmetry 
of a polygon with n sides. 

What happens when we let x — 2? Now we really have an infinite number of 
elements which correspond to any rotation angle in the plane. If we use the “R” 
notation for rotations, the group elements can be written as continuous function of 
the angle, which we call 0, 


lim. C, — R(8), (15.76) 


with 0 = @ < 27. Let’s check to be sure that this definition of the group elements 
satisfies the requirements of a group: 


1. The two rotations applied in succession, R(6,)R(62), is equivalent to R(@, + 62), 
which is also an element of the group. Keep in mind that if 6, + @ falls outside 
the range of 0 to 27, it is necessary to subtract an integer multiple of 27 to put 
the result back into the 0 to 27 range. 

2. The multiplication is associative, because [R(01)R(@2)]R(@3) = R(O, + 02 + 
63) = R(O,)[R(@2)R(83)I. 

3. There is an identity element, J = R(O). 

4. Each element R(@) has an inverse R(27 — 6). 
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We can also generate some matrix representations for these elements. An obvious 
one-dimensional choice is the complex exponentials: 


R(@) > e”9, (15.77) 


A different, two-dimensional representation is the set of matrices 


(15.78) 


R(6) > cos 8 o, 


—sin@ cos@ 


It is a worthwhile exercise to convince yourself that both these representations obey 
the underlying group multiplication table. 


15.5.2. The 3D Rotation Group OF 


The set of all possible rotations in three dimensions is another example of a continuous 
group. It is easy to show that all four group axioms are still satisfied when we move 
into three dimensions. 

The matrix form for the continuous two-dimensional rotation group, given by 
Equation 15.78, can be generalized to three dimensions. If we restrict the rotation 
to be around the z-axis, as shown in Figure 15.12, the matrix representation of the 
rotation is given by the 3 X 3 matrix 


cos@ sin@ O 
RA(@)— } —sin@ cosé 0 0=6<27. (15.79) 
0 0 1 


An arbitrary orientation in three dimensions can be accomplished using successive 
rotations around any three different axes. There are many ways to accomplish this, 
but the conventional choice is the rotation sequence 


Rib, 6, b) = Re(W)Ry (ORO). (15.80) 
Here the x'-axis is what the x-axis has become after the first rotation, and the z-axis 


is what the z-axis has become after the second rotation. It is useful to also describe 
this process in words. First, rotate around the z-axis by an angle @. This changes the 


Figure 15.12 Rotation Only About the z-Axis 
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Figure 15.13 Euler’s Angles 


x-axis into a new axis, x’, but leaves the z-axis alone. Next, rotate around this x'-axis 
by an amount 6. This transforms the z’-axis into the z-axis. Finally, rotate around 
the z-axis with angle y. The three angles @, 6, and & are called the Euler angles 
and are shown pictorially in Figure 15.13. 

We can write down the representations of each of the individual rotations that 
make up Equation 15.80. The rotation about z is the matrix 


cos@ sind O 
Rid) | -—snd cosd 0}. (15.81) 
0 0 1 


The rotation around x’ in terms of the primed coordinates is given by 


1 0 0 
RO) —> |0 cosd@ sm@!. (15.82) 
0 —sin@ cos@é 
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The final rotation around z” in terms of the double primed coordinates is 


cosy sing O 
Rah) | —sing cosp 0]. (15.83) 
0 0 1 


Multiplication of these three matrices gives the intimidating 3 X 3 matrix: 


~cos@sinwsin@ : cos@sinwcos@ : sin@siny 
+ cos d@ cos w : + sind cos 
Roy — cos @cos & sin d cos@coswcosd@ : sin @cos yw 
— sin yy cos d : —sinwsind 
sin 0 sind ; — sin @cos @ : cos 6 
(15.84) 


Because this matrix represents a rigid rotation of coordinates, it is necessarily 
orthonormal. It also has a determinant of +1. In fact, if w, 0, and @ are allowed 
to roam over their entire 0 to 277 ranges, we have ail the possible 3 X 3 orthogonal 
matrices with + 1 determinants. For this reason, when the group of three-dimensional 
rotations is written in this manner, it is called Oj , where the O stands for orthogonal, 
the 3 for the dimension, and the + to indicate the sign of the determinant. In the 
literature, it is also frequently called SO(3), where the S$ stands for the “special” 
condition of a +1 determinant. You also might run into the term unimodular to 
describe matrices with +1 determinants. 


15.5.3 The Group of Special Unitary Matrices SU(2) 


Our last example of a continuous group is the complete set of 2 < 2 unitary matrices, 
with determinants of +1. This group is commonly called SU(2). A matrix is unitary 
if its inverse is equal to the complex conjugate of its transpose. The general form of 
a2 X 2 matrix that satisfies these conditions is 


ab 
| 4: ae (15.85) 


with the requirement that a a* + bb* = 1 to ensure the unimodular condition. Notice 
there are really three free parameters here, because the real and complex parts of a 
and b provide two each, but the unimodular condition reduces this number by one. 
To show that these matrices form a group, we need to prove that the multiplication 
of any two of them still has the general form of Equation 15.85. This can be verified 
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by forming the product 


Gh. Bi a aa, — bb; a,b, + bay 
Ex [4% z| | ey ats —Bib, + aig] CPB) 


Now define a, = a,a, — b,b3 and b, = a,b, + b,a5, and notice the right-hand side 
of Equation 15.86 can now be written as 


= | (15.87) 
Ss a 
The unimodular condition still holds: 
4303 + bb; = (4a — b, b> ajay me bib>) + @b, + b,a3)(ajb> + bia) 
= @)G}a,a) + ayaibob; + by bibyb> + by bia,a 
= (a\a; as by bi )(aa; + b,b3) 
= J. (15.88) 


This group can be generalized to SU(n), the set of n X n unimodular, unitary 
matrices. These matrices can be shown to have (n? — 1) independent parameters. 


EXERCISES FOR CHAPTER 15 
1. Let S be a subgroup of a larger group G. If g; is an element in G that is not in S, 
show that g; ' is also not in S. 
2. Show that there are only two distinct fourth-order groups. 
3. How many classes does the vierergruppe have? 


4. Identify all the subgroups of D3. For each subgroup, indicate the elements of all 
its possible left and right cosets. Show that the ratio h/h’ is an integer for all of 
the subgroups, where h is the order of D3 and h’ is the order of the subgroup. 


5. Three elements of a group are represented by the following 2 X 2 matrices, 
1 0 w O 0 1 
0 1 0 w 1 0; 
where w = e 


(a) What are the other elements of this group? 


i2a/3 


(b) For each element identify its inverse. 
(c) Find all the subgroups and identify their elements. 
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6. In Table 15.21, we presented a 3 X 3 matrix representation of the D3 group. 


(a) Find the transformation matrix that puts this representation in block diagonal 
form. 


(b) What is the inverse of this transformation matrix? 


(c) Show that the character of the matrices in the block diagonal form are the 
same as the original matrices. 


7. Consider the twofold symmetric molecule composed of three pairs of atoms 
shown below. In the chapter, we developed the D2 group to represent the sym- 
metry of this object. The four elements of this group satisfy the vierergruppe 
multiplication table given in Table 15.7. 


The symmetry of this molecule allows the identification of a group of larger 
order. For example, the interchange of only the two largest atoms will also leave 
the structure unchanged and is not represented by an element of D2. The D2 
group describes only rigid body transformations. Interchanging the two largest 
atoms while leaving the others alone is not a rigid body transformation. 

(a) Identify all the elements of this higher-order group and determine its multi- 
plication table. 

(b) Show that D2 is a subgroup of this group. 

(c) Develop a3 X 3 matrix representation for this group. 

(d) Using the [123456] operator representation, identify the elements of this 
higher-order group. 

(e) Verify that h/h’ is an integer, where h is the order of the larger group and h’ 
is the order of D>. 

(f) Determine the left cosets of the Dy subgroup. 


8. Consider the symmetry group of the regular tetrahedron shown below. If nonrigid 
body transformations are allowed, this becomes the S4 group. 
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(a) What is the order of S,? 

(b) Obtain a nontrivial 4 < 4 matrix representation for this group. 

(c) Identify the classes of this group and show that the character of each of the 
matrices in part (b) is the same for each class. 

(d) Identify the subgroup of S4 that corresponds to only the rigid rotations of the 
regular tetrahedron. 


9. Construct a 4 X 4 matrix representation for the C4, group. The construction of this 
representation, which we will call ce, can be accomplished using the following 
method. According to the C, multiplication table, g3 g2 = g4. We will require 
that the g* element obey 


0 0 
1 0 
3 lo] = lo 
0 1 
Likewise, g3 g3 = I, so we will also require 
0 1 
0 0 
yi} = lo 
0 0 


(a) Continue this process and determine all four elements of the oa represen- 
tation. 

(b) Determine the characters of the cv representation. Compare them to the 
characters of the irreducible representations of C4. 

(c) The co representation is reducible. Using the completeness and orthog- 
onality of the character table of C4, determine the number of times each 
irreducible representation of C, appears in the reduced form of Cr 

(d) Give the transformation matrix that puts the representation into the reduced 
form in part (c). 


EXERCISES 


637 


10. Consider an object consisting of four identical atoms, as shown below: 


11. 


iy 
c | b 
(-1,D@------ @ (1,1) 
. x 
, x 
Cl-De- ----e(-t 
d a 


Let the D, group consist of the set of rigid body transformations that leave the 

appearance of this object unchanged. 

(a) Identify all the elements of this group using the [1234] operator notation. 

(b) Construct the multiplication table for D4. 

(c) Identify the inverse of each element of the group. 

(d) Identify all the subgroups of D4. 

(e) Without performing any similarity transforms, identify the different classes 
associated with D4. 

(f) How many possible irreducible matrix representations are there for this 
group? 

(g) A2X2 matrix representation can be constructed for this group by associating 
a two-dimensional transformation matrix with each element. Determine ail 
the elements of this 2 X 2 representation. 

If the restriction of rigid body transformations is relaxed, the symmetry of this 

object can be described by the S, symmetry group, which consists of all the 

permutations of four objects. 

(h) Identify all the elements of S4 using [1234] operator notation. 

(i) Show that Dy, is a subgroup of S4. 


Construct a 6 X 6 matrix representation for the D3 group, following the same 
prescription used in Exercise 9. We will call this representation pi, 
(a) Determine the character of the elements of the De! representations and 


include them as a fourth column in the character table for the D3 group, as 
shown below: 


ps) 


3) 


x21 


yi 
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(b) Show that the vectors formed by the character values of the irreducible 
representations are all orthogonal to each other, but not to the vector formed 
by the character values of the py representation. 

(c) Determine how many times each of the irreducible representations appears 
in the decomposition of the pe! representation. 


12. Consider the symmetry group associated with a diatomic molecule. This molecule 
has two identical atoms located in the xy-plane, as shown below: 


a 
Z 
7 


(1-6 
a 


(a) Determine the four elements of the symmetry group, which correspond to 
rigid rotations in the plane and reflections. 

(b) Construct a 2 X 2 matrix representation of this group by associating a trans- 
formation matrix with each element. 

(c) Determine the multiplication table of this group. 

(d) Find the transformation that simultaneously diagonalizes all the elements 
of the representation found for part (b) and list all the diagonalized matrix 
elements. 

(e) The transformation you found in part (d) can be viewed as a coordinate sys- 
tem transformation. Sketch the locations of the atoms in this new coordinate 
system. 


13. In Equation 15.68, we derived a matrix representation for the g2 element of D3. 
Derive the other elements of this representation, which are listed in Table 15.30. 


14. Another way to write the general form of the continuous SU(2) group is 


(15.89) 


e“cosy  e# siny 
—e BPsiny ecosy]’ 


where a, B, and y are all real quantities. Show that this form automatically 
satisfies both the unimodular and unitary conditions. 


A 


THE LEVI-CIVITA IDENTITY 


The three-dimensional Levi-Civita symbol is defined as 


+] for i, j,k = even permutations of 1, 2,3 
€ijx = 4 —1 for i, j,k = odd permutations of 1,2,3. (A.1) 
0 if two or more of the subscripts are equal 


One useful identity associated with this symbol is 
ijk Ersk = Sir Bj5 — Sis5jr- (A.2) 
To prove Equation A.2, start by explicitly writing out the sum over k on the LHS: 
Eijk Ersk = Eiji €rs1 + €ij2€rs2 + €ij3s3- (A.3) 


Now consider €;;; €,s, the first term on the RHS of Equation A.3. This term is zero if 
i, j,r, or S is equal to 1. It is also zero if i = j orr = s. Therefore, €;; €,5; is nonzero 
for only four combinations of the indices (i, j, r,s): (2, 3, 2, 3), (2, 3, 3, 2), (3, 2, 2, 3), 
and (3, 2, 3, 2). Thus the first term on the RHS of Equation A.3 becomes 


+1 for (i, j,r,s) = (2,3, 2, 3) or (3, 2, 3, 2) 
Gjsi = 4 —1 for (i, j,r,5) = (2,3, 3,2) or (3,2,2,3) . (A.4) 
0 otherwise 


The other two terms on the RHS of Equation A.3 behave in a similar fashion, with 
the results: 


+] for (i, j,r,s) = (1,3, 1,3) or G, 1,3, 1) 
€j2€52 = « —1 for (i, j, r,s) = (1,3, 3, 1) or G, 1, 1,3) (A.5) 
0 otherwise 
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and 


+1 for (i, j,7,5) = (1, 2, 1, 2) or (2, 1,2, 1) 
633 = 4 —1 for (i, j,r,5) = (1,2, 2,1) or (2,1,1,2) . 


0 otherwise 


(A.6) 


The LHS of Equation A.2 is the sum of the three terms described above. For a 
given set of values for i, j,r, and s with i # j and r # s, only one of the terms is 


nonzero. Therefore, 


+1 fori# j,r#sizr,j=s 


Eijk Ersk = § —1 fori#j,r#si=sjur. 


0 otherwise 


(A.7) 


But notice, these conditions are also satisfied by the quantity 5;,5;,; ~— 5;;5j,, because 


+] fori#j,r #s,j=s 


5ir 5 js — 5is5 jr = -1 fori# j,r#s,j=ur ‘ 


0 otherwise 
Hence, we have our result: 


€ijnErsk = Sir 8js — Sis djr- 


(A.8) 


(A.9) 


B 


THE CURVILINEAR CURL 


A rigorous derivation of the form of the curl, gradient, and divergence operators in 
curvilinear coordinate systems requires a careful analysis of the scale factors and 
how they change as functions of position. In the text, this was treated in a somewhat 
cavalier manner. In this appendix, we present a more rigorous derivation of the 
curvilinear curl operation. The final result is, of course, the same as presented earlier. 
The rigorous derivation of the gradient and divergence operations follow a similar 
manner. 
The curl operation can be defined by the integral relation 


pao-vxK= fara (B.1) 
S c 

where C is a closed path surrounding the surface S, and the direction of d@ is defined 
by the direction of C and the right-hand rule. If we align the surface perpendicular to 


the 1-direction, as shown in Figure B.1, and take the limit as the surface shrinks to a 
differential area, the LHS becomes 


lim | d@-VXA=q- [V XA] Myh3dqndg;. (B.2) 
— $< 
[V4], 


This term is proportional to the 1-component of the curl. 
The RHS of Equation B.1 in this differential limit is 


lim at-K= [ ar-a+ | ar-A+ | ava + | dt-A,  (B.3) 
C0 Jc C Q on Cs 
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4, 
are 
at 
Figure B.1 Orientation of Surface for Curvilinear Curl Integration 


where the loop has been broken up into four parts, as shown in Figure B.2. The 
contribution from C, is given by 


al 92, +dqn 
i, dt-A = i dqnAzhy. (B.4) 
Cc; 920 


Along C, both Az and hy can vary with g2. Since we are in the differential limit, we 
will consider only the lowest-order, linear variation of the integrand: 


O(Azh2) 
Aah = (Arha)| + 22) gy ~ gy), B.5) 
220 +930 42 920130 
Substituting Equation B.5 into Equation B.4 gives 
eit, h 2 
| dr-A = (Azha)| dqn + (Arh) (qa) (B.6) 
Q 920/930 942 92010 2 
Next the integration along C3 will be performed: 
_ 20 
i d¥-A= | dqnAzh. (B.7) 
fen 920 + dq 
(G20-G30+dq3) a eee (Goot+dq2,q3o+dq3) 
i es 
h3dq, | Cy Cc } C, j 


(20:30) @ ~~ (dao+dq2,q30) 


h,dq, 
Figure B.2 Differential Geometry for Curvilinear Curl Integrations 
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Here the expression for Ajh2 picks up an extra term because q3 = q3, + dq3: 


ie O(Azhz) 
20-430 0q2 


" O(Azh2) 


Azhy = (Azh2) 39 
3 


dq;. (B8) 


920430 


(42 — 420) 
920>30 


Substituting Equation B.8 into Equation B.7 gives for the line integral along C3, 


(dq) 


=: O(Agh 
| dt- A = —(Azh) dq2 (Azh2) 
C3 920-930 0q2 


_ WAzh?) 
043 


920+930 


dq2dq3. (B.9) 


20:90 


Similar integrations can be performed along C2 and Cy. When summed, these 
pieces give the value for the closed line integral: 


O(h3A3) — A(hy Az) 
442 043 


Notice how the higher-order terms have al} canceled. Substituting Equations B.10 
and B.2 into Equation B.1 gives for the 1-component of the curl of A, 


a ete bene _ eed 
942 43 


lim ar-X= | 


lim p dqndqs. (B.10) 


(B.11) 


The other components of V x A can be obtained by reorientating the surface shown 
in Figure B.1 to be along the other curvilinear axes: 


1 [a A1) — 004343) 
VXxAlL= B.12 
“Al, hyhy 0q3 aq, | 
1 [d(rzA2) — ACA Ay) 
VxAl,= - ; B.13 
Al; hyhy | oq 0q2 | : 


Equations B.11—B.13 can be written compactly using a determinant: 


hd: hd, yds 
d/dq, 38/dq, A/dq3\ . (B.14) 
hyAy hhypA2 3 Ag 


hyhyhs 
det 
or even more concisely using the Levi-Civita symbol and subscript/summation nota- 
tion: 


Gi OC Ay) 


Vv x A = €j; 
SUR A hg 0qj 


(B.15) 
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THE DOUBLE INTEGRAL IDENTITY 


In this appendix, the double integral identity 


| ax" | dx' f(x') -| dx! (x — x') f(x’) (C.1) 
Ja a a 
is proven. To derive this relation, make the substitution 
dy({x 
fe) = 22 (C2) 


Equation C.1 becomes 


ie axt f° dx 2) = fae "x i2@) (C.3) 


The first integration on the LHS, over the variable x’, can be done directly. The 
integral on the RHS can be rewritten using integration by parts, with u = (x — x’) 
and dv = (dy/dx')dx', to give 


| dx"ty(x") — ya) = (x — x(x) 


a 


* +f dx! y(x'). (C.4) 


When integration of y(a) on the LHS is performed, and the first term on the RHS is 
evaluated at the limits, the result is 


(a — x)y(a) + - dx" y(x") = (a — x)y(a) + ie dx’ y(x'). (C.5) 


Since x’ and x” are dummy variables of integration, the two sides of this equation are 
obviously equal, and the identity of Equation C.1 is proven. 
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D 


GREEN’S FUNCTION SOLUTIONS 


In Chapter 10, the Green’s function solution for a linear, nonhomogeneous, second- 
order equation was obtained using a superposition argument. We first solved the 
special case of a 6-function drive for the nonhomogeneous term. Then we argued 
that the general solution, with an arbitrary nonhomogeneous term, could be obtained 
by simply forming an infinite weighted sum of these 5-function solutions. This proof 
relied on a certain amount of intuition. A formal proof of this solution can be obtained 
by directly inserting the Green’s function solution into the differential equation, and 
showing that the nonhomogeneous differential equation and the boundary conditions 
are Satisfied. 

Consider the general linear, nonhomogeneous, second-order differential equation: 


dy(x) 
ax 


d 

ae [Po | — Ox)y(a) = A(x). (D.1) 
This equation is in Sturm-Liouville form, but keep in mind any linear, second- 
order equation can be put into this form with a little manipulation. We want to find 
the function y(x) which solves Equation D.1 and satisfies the set of homogeneous 
boundary conditions: 


aya) + a2) = 9 (D2) 
d 
By y(b) + Bo xe =0. (D.3) 
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We will prove that the integral 


b 
soz i dé h(é)e(xlé) (D.4) 


satisfies these conditions, where g(x|/) is a function that satisfies the differential 
equation 


d 
& (Ps a) ~ O(w)galé) = 8 - &) (D5) 
and the same set of homogeneous boundary conditions: 
ayg(al) + a2-—— ane) 90 (D.6) 
d 
Big(blé) + B,-~-— gee) =0. (D.7) 


We begin by describing the properties of the Green’s function g(xlé). Notice 
everywhere, except at x = &, g(x|é) satisfies the homogeneous differential equation 


5 |P (x dete) — Q(x)g(xlé) = 0. (D.8) 


If g;(x|&) and go(xl€) are the solutions to this homogeneous equation on either side 
of x = &, we can write 


fal)  asxse¢ 
ana (ole) i ics 


Since Equation D.8 is a second-order equation, its general solution will contain two 
arbitrary constants. But these constants are not necessarily the same on the two sides 
of x = &, so there really are a total of four unknown constants associated with Equa- 
tion D.9, two for g1(x|é) and two for g2(x/é). Therefore, we need four independent 
equations to completely specify g(xlé). Two come directly from the boundary con- 
ditions. Using g,(x|&) in Equation D.6 establishes one equation. Equation D.7 with 
82(x|€) establishes the second. A third equation comes from requiring that g(xl&) be 
continuous at x = &: 


gi(élé) = go(élé). (D.10) 


This continuity is guaranteed by the form of Equation D.5. If g(«|é) were discontin- 
uous at any point, the second derivative operation on the LHS of Equation D.5 would 
generate a term proportional to the derivative of a 6-function, a term that obviously 
does not exist on the RHS of Equation D.5. The last equation comes from integration 


GREEN’S FUNCTION SOLUTIONS 649 


of Equation D.5 across the 6-function: 
Ete d x éte 
| dx {2 [Pes ) at a) = oes} = i; dx &(x— €). (D.11) 
—e é-€ 
For arbitrarily small € this becomes 


=EteE ' 
— O€)g(€lé)2€ = 1, (D.12) 


x=€-€ 


P( py f2t8)| 


where we have assumed that Q(x) and P(x) are well-behaved functions, so near 
x = €, Q(x) — Q() and P(x) — P(&). As € — 0, Equation D.12 becomes 


dgo(xlé) 
dx 


_ dgi(xl€) 
dx 


xe 


1 
——~. D.13 
PO ite 


x=€ 


The 6-function on the RHS of Equation D.5 causes a discontinuity in the derivative 
of g(xlé) at x = &. 

Next we show that Equation D.4 is a solution to the original differential equation, 
by substituting it into Equation D.1. In terms of g,(x|é) and g2(x|€), this solution is 


x b 

yx) = [dg hé)gate) + [ae n@eate, (D.14) 
a x 

Before putting this in the LHS of Equation D.1, we need to evaluate its first and 

second derivatives. This is a bit tricky, because the derivatives are taken with respect 

to a variable that is in both the integrand and the limits of integration. To evaluate 

such a derivative, use the result 


d fe BO) §C(x, d 
=| acy = / dy SEY) 4 SO C6, oy - Cex, ata. 
X Ja(x) a(x) Ox 
(D.15) 
Therefore, 


BO fae nel + [ae me) EO + anedals) ~ Monertebo. 


(D.16) 


The last two terms in this expression cancel because the Green’s function is continu- 
ous, and we have 


oo) ak dt ng eel 5 +f dé gil) (D.17) 


650 GREEN’S FUNCTION SOLUTIONS 


The second derivative of y(x) is 


a9) = [deme B® *ga(xlé) | + fag ne EGO “aiG§) 


dgi(xl€) 
dx 


de20ah6) (D.18) 


+ h)———— | — he) 


x=E 


x= 


The last two terms of this expression do not cancel because of the discontinuity in 
the derivative of the Green’s function we discovered earlier. Using Equation D.13, 
however, these terms can be combined to give 


: d 
290) = is dé he) 8218) E218) +f dé ng S18) ais Fa we @.19) 


Substituting these expressions for y(x) and its derivatives into the LHS of Equa- 
tion D.1 gives 


P(x) 


dP(x) dentl6) —— 
+ ol fan [ene ef den gy B18) st 


x b 
200] [ dé miéreatsle) + [ dé Mento) (D.20) 


Both 2)(x[&) and g2(x{€) are solutions to the homogeneous Equation D.8. Because 
of this, all the terms in Equation D.20 cancel, except the single term A(x). Our final 
tesult is 


b 
{< ere 5] i a} / d& h(E)g(xl€) = h(x), (D.21) 


which shows that the Green’s function solution for y(x) indeed satisfies the original 
nonhomogeneous differential equation. 

It also easy to show that the Green’s function solution satisfies the original ho- 
mogeneous boundary conditions. The boundary condition at x = a is given by 
Equation D.2, and using Equation D.14 and D.17 evaluated at x = a, we can write 


‘ d 
ayy(a) + a oe = a | d& h(€)gi(alé) + af dé n(gy 2818) ace) 
b 
> | dé h(é) [eral + an 1G) ; | 


=0. (D.22) 
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Since we constructed the Green’s function to obey the same boundary condition, 
as given by Equation D.6, the last line in Equation D.22 is zero. A similar line of 
reasoning shows that the boundary condition at x = b, given by Equation D.7, also 
works. 
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E 


PSEUDOVECTORS AND 
THE MIRROR TEST 


The mirror test provides an intuitive way to identify a pseudovector. A “regular” 
polar vector will reflect, as you would expect, in a mirror but a pseudovector will be 


reversed. 


An example of this test, taken from electromagnetism, is described in Figure E.1. 
The real system on the left is a positively charged particle moving in a circular 
orbit, counterclockwise when viewed from above. This motion results in the average 


Real object 


oa 


yo 


Mirro, 


Figure E.1_ The Mirror Test for Pseudovectors 


i 


1B 


Mirror image 
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magnetic field pointing “up,” as indicated by the B vector. At a particular instant of 
time, this particle is at a position indicated by F and moving at a velocity v. Now view 
the image of this motion in the mirror shown in Figure E.1. The image particle moves 
in a clockwise, circular orbit. This motion is properly described by the images of the 
position and velocity vectors. Hence, they are regular vectors. The reflected image of 
the magnetic field vector is still in the up direction. But notice, if a positively charged 
particle were really performing the motion of this image particle, the magnetic field 
would point down, as indicated by the dashed B vector. The magnetic field vector 
fails the mirror test, and is therefore a pseudovector. 


F 


CHRISTOFFEL SYMBOLS AND 
COVARIANT DERIVATIVES 


Non-orthonormal coordinate systems become more complicated if the basis vectors 
are position dependent. An example of a two-dimensional coordinate system of this 
type is shown in Figure F.1. In this system, the displacement vector can still be written 
as 


d¥ = dx! 8 + dx? g. (F.1) 


Since 


OT OTF 
dt = SS I ow 2 F2 
F (Sa) a + (=) dx‘, (F.2) 


= —, (F.3) 


but now are functions of position. The effect of nonconstant basis vectors is most 
evident when applying derivatives to vector and scalar fields. 

In Chapter 14, the gradient operation was discussed for skewed coordinate systems, 
where the basis vectors were not orthonormal and had constant basis vectors. The 
gradient was defined as 


as (F.4) 
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8,4 dx°B 


A 
\ 


Figure F.1 A Coordinate System in Curved Space 


and the del operator itself was identified as 


wana? 
V=¢é oat (F.5) 
The derivative d¢/dx', is the i covariant component of the gradient vector. Notice 
how the contravariant basis vector ¢! is not differentiated. This is important, because 
when we move to systems where the basis vectors are no longer constants, the gradient 
operation does not need to be modified in any way. 

Things are very different for the divergence of a vector field. Using the same 
definition for the del operator, the divergence of a vector V = V'8; becomes 


V= 85 - (V'B:) . (F6) 


Now the spatial derivative operates both on the vector components V' and the §; basis 
vectors. Rearranging Equation F.6 gives 


V-V=28'- — (Vig). (F.7) 
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If the basis vectors are not constants, the RHS of Equation F.7 generates two terms 


Se w_aj avi , ( 08 
V-V=2#- ——- 6+ Vi { — Fi F. 
[Gere Go) i 


The last term in Equation F.8 is usually defined in terms of the Christoffel symbol 
ré.: 
ij 


se = Th &. (F.9) 
The definition in Equation F.9 implies the result of the differentiation on the LHS 
must be a vector quantity, expressed in terms of the covariant basis vectors £,. 
The contravariant components of the vector quantity are given by the Christoffel 
symbol with a superscripted k. These components obviously also depend on which 
basis vector is being differentiated, given by the i index, and which coordinate the 
differentiation is being taken with respect to, given by the j index. Because of 
the nature of the LHS of Equation F.9, these indices are written as subscripts, so 
that the Christoffel symbol in Equation F.9 obeys the normal superscript/subscript 
conventions. It is important to note, however, the Christoffel symbol is not a tensor. 
Its elements do not transform like the elements of a tensor. 
In order to complete our discussion of the divergence, we must evaluate the 
Christoffel symbols in terms of the coordinate system geometry. Equation F.9 can be 
solved for I; by dot multiplying both sides by é': 


4. abi , 
a = The &). (F.10) 
or 
es 
mj =e rai (F.11) 


The basis vectors can still be written in terms of derivatives of the position vector, 


or 
6, = —, F.12 
i=, (F.12) 
so the Christoffel symbol becomes 
or 
i=? -—. F.13 
Vy ax) ax! oe 


This equation clearly indicates that the Christoffel symbol has a symmetry with 
respect to the subscripted indices 


r=, (F.14) 


Equation F.13 provides a method for evaluating the Christoffel symbol, but it is 
not very useful because it is in terms of the derivatives of the position vector and 
the basis vectors of the coordinate system. A more useful relation can be derived by 
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considering the metric. Remember the metric for a coordinate system is 
Mi; = 8° 8). (F.15) 


Even though the Christoffel symbol is not a tensor, this metric can be used to define 
a new set of quantities: 


Taj = Mul; (F.16) 
This quantity, I,;;, is often called a Christoffel symbol of the first kind, while [* j 
is a Christoffel symbol of the second kind. Notice the Christoffel symbol of the first 
kind exhibits the same symmetry with respect to the last two subscripts: 


Puy = Vp. (F.17) 


Combining Equations F.11 and F.16 gives 


Ty = Mub! - 76 
Oo o. 
= &- =. (F.18) 
The spatial derivative of the metric, 
at = §;- (#) + (%) > B), (F.19) 
can now be written in terms of Christoffel symbols as 
oe =e ae (F.20) 
Similarly, 
ae =Ty;+Tuy (£21) 
and 
ste = Djg + Taj. (F.22) 


Adding Equation F.22 to F.21 and subtracting F.20, while making use of the symmetry 
of the Christoffel symbols, gives 


(F.23) 


oxs oxi ax* 


Tepes 


1 é; * OM jx _ =) 
5) ‘ 
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Raising the first index with the metric gives an expression for the Christoffel symbol 
of the second kind in terms of the coordinate system’s metric: 


kL ‘ . = 
Hees (% 4 ee a) (F.24) 


a 2 \ oxi oxi ox! 
This equation allows us to evaluate the Christoffel symbol if we know the metric. 


Returning to the divergence operation, Equation F.8 can now be written using the 
Christoffel symbol as 


=gi- (vig, 
a’: = (V8) 
, [favi. ns, 
ae (Se) +V r%t| (F.25) 


The quantity in brackets on the RHS is referred to as the covariant derivative of a 
vector and can be written a bit more compactly as 


<_ (vig) = (S + vrs) be. (F26) 


where the Christoffel symbol can always be obtained from Equation F.24. If the basis 
vectors are constants, I j = 9, and the covariant derivative simplifies to 


ave 
as you would expect. 
__ The curl operation can be handled in a similar manner. The curl of the vector field 
V is 
VxV= eS 7 X (V'Bi). (F.28) 


which, written in terms of the covariant derivative, is 


She TOV ar 
VxV= (+ vrs) (es xe). (F.29) 
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CALCULUS OF VARIATIONS 


The development of the calculus of variations begins with an integral in the following 
form: 


l= je dx f{x,y(x), dy/da}. (G.1) 


The function in the integrand depends upon the independent variable « and on the 
unknown function y(a) and its derivative dy/dx. Both y(x) and its derivative are 
taken to be functions of the independent variable +. The problem is to determine the 
function y(xz) so as to minimize the value of the integral. An example of such an 


integral is. 
1/2 
xr2 d 2 
r= f de +(2)| (G.2) 
ee dx 


This form looks like that of an integral equation because the unknown function is 
inside an integral operation. But it is unlike an integral equation in that the value of 
the integral is not known and also because the minimization of J does not uniquely 
determine y(z) unless a set of boundary conditions like 


y(zi}) = Nn (G.3) 
y(te) = yp (G.4) 


is specified. Here y, and y2 are known constants. 

The minimization process is accomplished by introducing a parameter a so that 
I -> I(q) and requiring d!/da = 0 ata = 0. The a parameter is introduced by 
letting 


y(2) — y(x, a) = y(x) + an(x) (G.5) 
where y(z) is the function that minimizes [(a) with a = 0. The idea is that adding 
any function 7(x) with a 4 0 causes [(a) to be larger than I(@ = 0). 
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The function y(z, a) and its derivative are then introduced into the integrand of 
Equation G.1. 


y(z) —> y(x,a) (G.6) 
dy Oy(z, @) 
—_ -> =O G.7 
dx Oz. a) 
The boundary conditions 
y@=1,a) = (G.8) 
y(xz=22,a) = yo (G.9) 
require 
n(r1) = n(x2) = 0. (G.10) 
A shorthand notation 
Oy(x,a 
2ulee) = Ya (x, a) (G.11) 


will be used to simplify the equations. 
The problem now becomes one of minimizing I(q@) with respect to a where 


(a) = f° de fle-y(z,a), ye(0,4)] (G.12) 
under the conditions 
y(z71,0) = (G.13) 
y(a2,a@) = yp (G.14) 
nai) = 0 (G.15) 
nx) = 0. (G.16) 


To accomplish this take d/da of the above expression and set it equal to zero at 
a=0. 


d ft 
o-{2 [ te flew(ea)aa(ma))) (G17) 
7 "Taf dy(e,a) , Of Iye(e,a) 
2 4 y(x,a Yo: (@, 
o= tf ax | a Pe bn }} (G.18) 
Now 
y(x,a) = y(x) + an(x) (G.19) 
so 
SE eer (G.20) 


0a 
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and 


Oo 
yr(t, 0) = 2 [y(a) + an(2)} (G.21) 

x 
_ dy(z) | dn 
Yr(@,a) = eee as (G.22) 

so that Buel F 

Ya (t, 7) . 
= 23 
Ba da me) 


Notice that these partial derivatives of y(x, a) and y,(x, @) with respect to @ are no 
longer a function of a. 

Substituting these values for the partial derivatives of y(z.a) and y,(z,a) into 
Equation G.18 gives, 


at al is OL 
O= if dr sate) Tk Bus a = (G.24) 


The second integral 


vy 


2 Of dn 
dzr—~ —. (G.25) 
a Oyy dx 
is attacked by an integration by parts with 
Pelee (G.26) 
dx 
and af 
v= a G.27 
u an ( ) 
so that ss 
MES OF ah Of | le d Of 
r——-— = —} — dx ——., G.28 
/ bas dr @ OYr a ; WO OYr ee) 


The first term on the right hand side of this equation vanishes because 77(.7) is zero at 
x, and ry. 
The minimization condition then becomes 


2 0 dO 
O= / dz (zx) E = ae | \ , (G.29) 
v1 % a=0 


Since this must be true for any 7(a2) when a = 0, the condition for minimizing /(@) 
becomes 


Of da of 
aay (eee 3 
; E dx a o _ 


In this expression 
f= flz.y(x.@), yx(x. @)] (G.31) 
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where 
y = y(x, @) = y(z) + an(z) (G.32) 


and the function y(z) is the function we seek, the function that minimizes the original 
integral. In the limit a = 0 the above condition for minimization of the original 
integral becomes 


Of d of 
0= Gy de ue (G.33) 
where 
y = y(2) (G.34) 
and 
pk dy(a) (G.35) 
dz 
Equation G.33 is solved for y(z) given the boundary conditions 
ya) = (G.36) 
y(@2) = ye. (G.37) 
This is known as Eulers Equation. It can be written in equivalent form as 
Of d Of 
= : G.38 
0 Dy de (4 Ys 5) ( ) 


This equivalence can be seen by expanding the total derivative with respect to z. 
Equation G.33 is useful if there is no explicit y-dependence in the integrand because 
it reduces to 


0= —>— (G.39) 


ee (1 Ya x). (G.40) 


ERRATA LIST 


p. 10 Change Equation 1.48 to read: 
(C| = |A| |B] sin 8, (1.48) 


p. 16 
(i) Exercise 2. first equation, change c, 7 to 2,7: 


Mij =ij? for i,j = 1,2,3, 


(ii) Exercise 4, first line, change "row matrix" to “column matrix". 


(ii) Exercise 4, first equation, change equation to read: 
(IDV yy! = (vy 
p. 49 Change Equation 3.14 to read: 
F = (F-6,)6, + (F- &g)€e + (F- yg )&y. (3.14) 
p. 64 Exercise 20, change equation to read: 
B = Boaz. 


p. 65 Exercise 20. change equation to read: 


Bop 
qe; 
Po ss 


B= 
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p. 75 Change Equation 4.45 to read: 
p. 133 Exercise 25, first line, change p.(z, y, y) to pe(&, y, z). 


p- 152 In all equations on the page, change c,, to c,,, i.e., 


f(x) = ve Cn(£ ~ Lo)” = Co +.c1(H — Zo) + Ca(H@ — 20)? ++: (6.96) 
n=0 
1d” 
=) (6.97) 


Cn (x — Lo)” 


li Oo 1. 6.98 
exes Cn—1(£ — Lo)! x ( ) 
+00 
F(t) = S> en(w- 20)” (6.99) 
n=—oco 
SO Ea ee a al (eed hea a ag a 2 


(c—2,)? © (@— 26) 
Pp. 162 Paragraph below Equation 6.135, 
(i) third line, change C) to C2 and C2 to C1. 
(ii) seventh line, change C2 to C}. 
p. 215 Exercise 49. (b), change y = 1/2 tou = 1/2. 


p. 216 Exercise 55, change the sentences beginning in the second line of the para- 
graph to the end of the paragraph to read: 
Integrate to obtain z = z(w). 


p. 241 Change Equation 7.90 to read: 


2N4+no—-1 
ik See" (7.90) 
N=No 


p- 242 Change the first line of Equation 7.92 to read: 


N-1 


= x cer 
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Pp. 300 Exercise 21: 


(i) Change the last sentence of the first paragraph to read: 
“Show that the conditions for closure with zero contribution are different for a 
real, definite integral than they are for a Fourier inversion.” 


(ii) Change the first equation to read: 


2 
F(w) = 


ws + 4 


(ili) Change the equation under (b) to read: 
f(t)= i da et F(w) 
p. 316 Figure 9.14, change the label on the abscissa from “imag” to “real”. 
p. 318 Figure 9.16, change the label on the abscissa from “imag” to “real”. 


p. 354 Change the part of the sentence below Equation 10.104 to read: 


“*... where the c,, are unknown coefficents, and s is a fixed number, not necessarily 
an integer or pure real. Often, however, it is a positive or negative integer.” 


p. 379 Figure 10.16, change —K x(t) to read -K, x(t). 

p. 383 Title of Figure 10.21, change “Principal” to “Principle”. 

p. 408 Exercise 22 (a), last line, change x = x, to y = Yo. 

p. 421 Exercise 53, first line below the equation, change a(x), b(a) to p(x), q(x). 
p. 422 Exercise 54 (d), first line, change d(t) to d(z). 


p. 428 Change Equation 11.20 to read: 


Amne tt vers ett [Cmy ett ene lmy 

+ Bret VEer= ett [Emy et r/ —Cne—Cmy 

+ Cpe eVene em [Cmy eb Arf —enx—Cmy 

oe. + Dire tVCnxe@ yvemy gh FV one emy 

mon Timn ettvenset¥y Cmye 7Y one omy 

LE —£VCns ptuVemy pz ~ ene —Cmy 
mne€ € € 

+GinnettVenz es /Emy e—*V/ —Cn2x—Cmy 

+Himne (Vr te¥ /Emy e~7*V TCnx—lmy 


(11.20) 
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p. 448 Split the last sentence of the paragraph Orthogonality Relations into two 


sentences: 
.. eigenfunction problem. 


A Hermitian operator can then be identified and the orthogonality properties of the 
eigenfunctions determined. 


p. 449 Change Equation 11.111 to read: 


dJ, Slt 


[J-(omp/ro)p 7 =i) (11.111) 


Q 


p. 453 Change Equation 11.136 to read: 
o0.= Sofa {ome {2G arse 
p. 462 Change Equation 11.176 to read: 
Ad(z) = +++ + ani fet + 29? 42 t4 4. } (11.176) 
p. 463 
(i) Line above Equation 11.180, change Bg(x) to Be(2). 
(ii) Line above Equation 11.181, change Ag(z) to Ao(2). 
p. 464 Third paragraph, change “‘six” to “five”. 
p. 473 Figure 11.21, last line of equation, change +Vo to -Vo. 


p. 474 Equation 11.237: replace closing brace by opening brace: 


&(r,0,¢) = >_ se ae 1{¥an( ¢). (11.237) 


£=0 m=—£ 


p. 482 Exercise 16(a), change Equation to read: 


(0) = [ag F690 19) 


p. 484 Exercise 24, change the second and third line below the first equation to 
read: 
“ _, the differential equation for radially dependent component becomes” 
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p. 489 Exercise 34: insert 0 on the right-hand side of Laplace’s equation in two 
dimensions: 


‘ 1 
V?U(q,@) =— 
(q15 42) haha 


0 hy O 0 hg O 
22 eo Be ly =0, 
Og hi Oq, = Oqa hy Fas (a1, 42) 


p. 507 Exercise 7, change equation to read: 
war) =x — [ dt (t — x)wct). 
0 


p. 508 Exercise 12.i., change equation to read: 
1 
w(z) = | dt (x — t)*4(t). 
-1 


p. 537 First line of the paragraph above Equation 13.45, change sentence to read: 
“We should also look carefully at the little circular part...” 


p. 542 
(i) Second line from top of page, change term to read dz = e'?™ dr, 


(ii) Change Equation 13.57 to read: 


¢ =2r= [+ [= (13.57) 
Co 1 3 


p. 547 Change Equation 13.72 to read: 


[4+ j—— = — = —=0 at the saddle points. (13.72) 
Zz 


p. 581 Change Equation 14.100 to read: 


= tha, (14.100) 
Jj 


p. 593 Exercise 3, change equation to read: 
M = M388? = MY 8:8). 


p. 611 In the first line of the paragraph above Table 15.16, change “are shown” to 
“is shown”. 


p. 614 Inthe fourth line from the top of the second paragraph, change “dimension” 
to “dimensions’’. 
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p. 619 Table 15.25, row ce, column g,, change “—1” to “—2”. 


p. 623 Change the sentence beginning in the 2nd line of the 2nd paragraph to read: 


[2] 


representation, no D,” representation, and 


“The net result is that there is one D} 


one pe representation in the block diagonalization.” 


p. 625 Equation 15.62, change element (1,3) and (3,1) from “1” to “—1”. 
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INDEX 


Abelian group, 598, 618 

Absolutely integrable function, 254 

Aliasing, 240-241 

Analytic continuation, 175 

Analytic function, see Complex functions, analytic 
Associated Legendre equation, 460 

Associated Legendre polynomials, 472 
Autocorrelation, 266 

Axial vector, 89 


Basis functions, 234 
Basis vectors 
contravariant, 576-579 
covariant, 573, 576-579 
curvilinear coordinates, 49-51 
nonorthogonal, 564 
orthonormal, 2, 44, 75 
position dependent, 655 
position independent, 2, 564 
Bessel equation, 444 
spherical, 354 
Bessel functions 
first kind, 444-446 
modified, 453 
orthogonality, 448-450 
second kind, 444-446 
spherical, 357 
Bilateral Laplace transform, see Laplace 
transform, double-sided 
Block diagonal matrices, 613 
Boundary conditions, 342 
Dirichlet, 429 
for Green’s functions, 384, 387-391 


homogeneous, 384 

Neumann, 429 

nonhomogeneous, 387-391 
Branch cuts, 519-522 
Branch points, 516-519 
Bromwich contour, 313, 333 


Cartesian coordinates, 2 
Cauchy integral formula, 147-150 
Cauchy integral theorem, 144-147, 510-513 
Cauchy-Riemann conditions, 141-144, 546 
Causality, 370 
Character, 610-612 
Character table, 621-622 
Child-Langmuir problem, 359-366, 410-412 
Christoffel symbols, 655-659 
Class, 610-612, 614 
Cleverly closed contour, 540 
Closure, 175-189, 284-285, 310-312 
Cofactor, 572 
Completeness 
Fourier series, 234 
group representations, 622-623 
Sturm-Liouville eigenfunctions, 437 
Completing the square, 271 
Complex functions, 138-202, 509-542 
analytic, 140-150 
derivatives of, 140-144 
hyperbolic, 140 
logarithm, 140 
multivalued functions, 509-542 
trigonometric, 139-140 
visualization of, 138 
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Complex plane, 135-136 
Complex variables, 135-137 
conjugates, 136 
magnitude, 136 
polar representation, 136-137 
real and imaginary parts, 135, 136 
Conductivity tensor, 67-70, 76 
Conformal mapping, see Mapping, conformal 
Conservative field, 38 
Continuity equation, 27~29, 361 
Contour deformation, 146-147, 532-539, 542 
Contravariant components, 568-582 
Convergence 
absolute, 150, 151 
complex series, 150-151 
Pourier series, 231-234 
mean-squared, 232-234 
pointwise, 232 
ratio test, 151 
uniform, 155, 232 
Convolution, 261-265, 325 
Fourier transform of, 261-280 
Laplace transform of, 320~—323 
Correlation, 265-266 
Cosets, 608-610 
Covariant components, 568-570, 573-582 
Covariant derivatives, 655-659 
Cross-correlation, 265 
Cross product, 8, 10-12, 53, 86-92 
Curl 
Cartesian coordinates, 24 
curvilinear coordinates, 55—58 
cylindrical coordinates, 58 
integral definition, 34 
nonorthonormal coordinates, 659 
physical picture of, 29-32 
spherical coordinates, 58 
Current density, 28, 68 
Curvilinear coordinates, 49-58 
Cylindrical coordinates, 45-47 


Decomposition of block matrices, 622 
Del operator 
Cartesian coordinates, 23-24 
curvilinear coordinates, 54-58 
identities, 32-34, 41 
Delta function, Dirac, see Dirac delta function 
Delta, Kronecker, see Kronecker delta 
DeMoivre’s formula, 203 
Density functions, singular, 114-121 
Differential equations, 339-403 
boundary conditions, 342 
constant coefficients, 347-349 
coupled, 341 


INDEX 


exact differential, 343-345 
first-order, 342-346 
Fourier transform solutions, 366—37 1 
Green’s function solutions, 376-403 
homogeneous, 341-342 
integrating factor, 345-346 
Laplace transform solutions, 371~376 
linear, 340-342 
nonhomogeneous, 341-342, 351-354 
nonlinear, 340, 358-359 
order, 341-342 
ordinary, 340 
partial, 340, 391 
second-order, 347-354 
separation of variables, 342-343, 424475 
series solutions, 354-358 
terminology, 339~342 
Diffusion equation, 391-398 
Dipole, 123-125 
Dipole moment, 122 
Dirac delta function, 100-126 
complicated arguments, 108-111 
derivatives of, 112-114 
Fourier transform of, 269-270 
integral definition, 106-108 
integral of, 111-112 
Laplace transform of, 314-316 
sequence definition, 104-105 
shifted arguments, 102 
three-dimensional, 115 
use in Green’s function, 378, 382 
use in orthogonality relations, 253-254 
Direct sum of block matrices, 613 
Dirichlet boundary conditions, 429 
Discrete Fourier series, 234-242 
Displacement vector, 52 
Divergence 
Cartesian coordinates, 24 
curvilinear coordinates, 54—55 
cylindrical coordinates, 58 
integral definition, 34 
nonorthonormal coordinates, 656-659 
physical picture of, 27-29 
spherical coordinates, 58 
Dot product, 7-10, 53 
nonorthonommnal coordinates, 568—570, 573 
tensors, 70 
Double-sided Laplace transform, see Laplace 
transform, double-sided 
Doublet, 112-114 
Dyadic product, 71 


Eigenfunction, 433-440 
Eigenvalue, 79, 435-436 
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Eigenvector, 79, 435 
Einstein summation convention, 2 
Electric dipole, 123-125 
Electric monopole, 122-123 
Electric quadrapole, 133 
Elliptical coordinates, 62 
Essential singularity, 170 
Euler angles, 632 

Euler constant, 445 

Euler’s equation, 136-137 
Exact differential, 343-345 


Factorial function, 553 
Fast Fourier transform (FFT), 234 
Field lines, 20 
Fourier series, 219-242 
circuit analogy, 223~224 
convergence, 231-234 
discrete form, 234-242 
exponential form, 227-231 
orthogonality conditions, 221-223, 228-229 
sine-cosine form, 219-223 
of a square wave, 225-227 
of a triangular wave, 224-225 
Fourier transform, 250-295 
circuit analogy, 256~257 
of a convolution, 261 
of a cross-correlation, 266 
of a damped sinusoid, 287-290 
of a decaying exponential, 283-287 
of a delayed function, 258 
of a delta function, 269-270 
of a derivative, 259-260 
differential equation solutions, 366-371 
of even and odd functions, 259 
existence, 254-256 
of a Gaussian, 270-273 
integral equation solutions, 499 
limits of, 303-313 
orthogonality condition, 253-254 
of a periodic function, 273-275, 279-280 
of a product, 260-261 
of pure real functions, 259 
relation to Fourier series, 250-253, 275 
of a square pulse, 267~269 
transform pair, 252 
Fredholm equation 
first kind, 492-499 
second kind, 492, 504 
Frobenius, method of, 354-358, 444, 461, 
469 


Gamma function, 553-554 
Gauss’s theorem, 35-36 


Gaussian 
Fourier transform of, 270-273 
sequence function, 105 
General relativity, 565-566 
Generalized coordinates, see Curvilinear 
coordinates 
Generalized functions, 100, 103 
Geodesics, 566 
Gibbs phenomenon, 233 
Gradient 
Cartesian coordinates, 24 
curvilinear coordinates, 54 
cylindrical coordinates, 58 
integral definition, 34 
nonorthonormal coordinates, 582, 656 
physical picture of, 24-27 
spherical coordinates, 58 
Gram-Schmidt orthogonalization, 437 
Green’s functions, 376-403, 647-651 
boundary conditions, 384, 387-391 
for diffusion equation, 391-398 
for driven wave equation, 398-403 
multiple independent variables, 391-403 
for a stretched string, 385-391 
symmetry properties, 398, 422-423 
translational invariance, 378, 398 
Green’s theorem, 36-37 
Groups, 597-634 
Abelian, 598, 618 
C4, 598-601 
character, 610-612 
character table, 621-622 
class, 610-612, 614 
continuous, 630-634 
cosets, 608-610 
Dy, 601-603 
D3, 604-605 
definition, 597-598 
finite, 598-607 
multiplication table, 598-599 
order, 598 
permutation, 607 
rearrangement lemma, 599 
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representations, see Representations, group 


rotation, 630-633 
SU(2), 633-634 
subgroups, 607-608 


Harmonic oscillator 

damped, driven, 368-37 | 

simple, 347 

undamped, driven, 378-381 
Heaviside step function, 111-112, 316-317 
Heisenberg uncertainty principle, 273 
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Hemholtz’s theorem, 38-40 
Hermitian 
matrix, 80, 435 
operator, 436-440 
Homogeneous boundary conditions, see 
Boundary conditions, homogeneous 
Homogeneous differential equation, see 
Differential equations, homogeneous 
Hyperbolic coordinates, 62 


Impulse, ideal, 101-102 
Indicial equations, 355 
Inner product, see Dot product 
Integral equations, 491-506 
classification of, 492493 
Fourier transform solutions, 499 
Laplace transform solutions, 499-500 
relation to differential equations, 
493-498 
separable kemel solutions, 504-506 
series solutions, 501-503 
Integrals 
analytic functions, 144-150 
closed, 21, 175-189, 510-513 
line, 7, 21, 53 
multivalued functions, 510-513, 534-542 
operator form, 20~21 
principal part, 184-188 
surface, 22-23, 53 
volume, 23, 54 
Integrating factor, 345-346 
Irreducible matrix representation, see 
Representations, group 
Irrotational vector field, 40 


Jordan’s inequality, 285 


Kernel, 492 

causal, 498 

separable, 504—506 

translationally invariant, 498 
Kronecker delta, 8-10, 563-564, 574, 581 


L’Hopital’s ruie, 170 

Laguerre polynomials, 488 

Laplace contour, 313, 333 

Laplace equation, 424-475 
Cartesian coordinates, 424-433 
conformal map solutions, 192-196 
cylindrical coordinates, 441, 457 
spherical coordinates, 458-475 

Laplace transform, 313-335 
circuit analogy, 326-331 
of a convolution, 320-323 


of a delayed function, 319 
of a delta function, 314—316 
of a derivative, 319-320 
differential equation solutions, 371-376 
double-sided, 33 1-335 
of a growing sinusoid, 317-318 
integral equation solutions, 499-500 
inversion contour, 313, 333 
orthogonality condition, 316 
of a product, 323-326 
relation to Green's function, 377-381 
of a step function, 316-317 
transform pair, 313 
Laplacian, 33, 34 
Laurent series 
complex functions, 159-171 
Teal functions, 152 
Left-hand rule, 10, 88 
Left-handed coordinate system, 10, 86-92 
Legendre polynomials, 464-466 
Levi-Civita symbol, 11-92, 639-641 
Line distributions, singular, 119-121 
Linearly dependent functions, 350, 354 
Lorentz transformation, 583-585, 592 
Low-pass filter, 292 


Mapping, 138 

conformal, 189-202 

Riemann sheets, 532-534 

Schwartz-Christoffel, 196-202 
Matrices, 3-5 

array notation, 3 

Hennitian, 80, 435 

multiplication, 3-4 

notation, 3 

trace, 16, 610 

transpose, 5 

unimodular, 633 

unitary, 633-634 
Maxwell equations, 367—368 
Metric tensor, 569-570, 575, 579, 581, 

588-592 

Minkowski space, 591 
Mirror test, for pseudovectors, 653-654 
Moment of inertia tensor, 78 
Moments of a distribution, 121~125 
Monopole, 122-123 
Monopole moment, 122 


INDEX 


Multiplication table, see Groups, multiplication 


table 
Multipole expansion, 121-125 


Neighborhood, 142 
Neumann boundary conditions, 429 
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Neumann functions Ratio test, 151 

Bessel equation solution, 444-446 Rearrangement lemma, 599 

spherical, 357 Reducible matrix representation, see 
Neumann series, 501-503 Representations, group 
Nonhomogeneous boundary conditions, see Relativity 

Boundary conditions, nonhomogeneous general, 565-566 
Nonhomogeneous differential equation, see special, 564-565, 583-592 
Differential equations, nonhomogeneous Removable singularity, 169 
Nonisotropic materials, 68 Representations, group, 600 
Normal modes of vibration, 624-630 completeness, 622-623 
Notation, overview, 1-5 decomposition of, 622 
Nyquist sampling rate, 294 equivalent vs inequivalent, 614 
irreducible, 613-623 

Ohin’s law, 67~70 matrix notation, 612-614 
Operators operator, 606-607 

group representations, 606-607 orthogonality, 617-621 

Hermitian, 436-440 reducible, 613-617 

linear differential, 341 Residue theorem, 171-175, 510-513 

Sturm-Liouville, 438-439 Riemann integration, 125-126, 256, 327 
Order Riemann sheets, 513—516, 532-534 

of a differential equation, 341-342 Right-hand rule, 10, 22, 87 

of a group, 598 Right-handed coordinate system, 10, 46, 48, 53, 
Orthogonality relations 82, 86-92 

associated Legendre polynomials, 472 Rodrigues’s formula 

Bessel functions, 448-450 associated Legendre polynomials, 472 

exponential Fourier series, 228-229 Legendre polynomials, 464 

Fourier series, 221-223 Rotation matrix, 6, 12-13 

Fourier transform, 253-254 

Laplace transform, 316 Saddle point, 545-547, 554-555 

Legendre polynomials, 465-466 Sampling theorem, 290-295 
Outer product, see dyadic product Scalar fields, 18—19 

Scalar potential, 38, 40 

Parity conservation, 90 Scale factors, 49-51 
Period, 220 Schwartz-Christoffel mapping, see Mapping, 
Permutation group, 607 Schwartz-Christoffel 
Permutations, even and odd, 11 Separation of variables 
Phasors, 148 first-order differential equations, 342-343 
Piecewise smooth function, 232, 233 Laplace equation in Cartesian coordinates, 
Point mass, 102-103, 114-116 424-433 
Polar vector, 89 Laplace equation in cylindrical coordinates, 
Pole, 169 441-457 
Position vector, 44-45 Laplace equation in spherical coordinates, 

cylindrical coordinates, 46 458-475 

spherical coordinates, 49 Sequence functions, 104-105, 107-108 
Positive definite function, 436 Sheet distributions, singular, 116—119 
Principal part, 184-188 Shur’s lemma, 623-630 
Pseudo-objects, 86-92 Sifting integral, 106, 112 

pseudoscalar, 90-91 Similarity tansform, 610 

pseudotensor, 91-92 Sinc function, 107-108, 268 

pseudovector, 11, 86-90, 653-654 Singularity circle, 156, 163-~167 

Skewed coordinates, 565-567, 585-588 

Quadrapole, 133 Solenoidal vector field, 40 
Quadrapole moment, {22 Special relativity, 564—565, 583-592 


Quadrature, method of, 358-366 Speed of light, invariance, 564, 585 
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Spherical coordinates, 48-49 

Spherical harmonics, 474-475 

Steepest descent, method of, 542-555 

Step function, see Heaviside step function 
Stirling’s approximation, 554 

Stokes’s theorem, 37-38 

String problem, 385-391 

Sturm-Liouville form, 438-439 

Subgroups, 607-608 

Subscript notation, 2 

Subscript/summation notation, 3, 12~15, 32-34 
Subscript/superscript notation, 573-575, 581 
Summation convention, Einstein, 2 
Superposition principle, 341, 382 

Symmetry operations, 599 


Taylor series 
complex functions, 139-140, 153-159 
real functions, 152 

Tensors, 67-86, 562-592 
contravariant vs. covariant, 579-581 
coordinate transformations, 76~78, 84-86, 

562-564, 580 

diagonalization, 78-84 
metric, 569-570, 375-579, 581, 588-592 
in non-orthogonal coordinates, 562-592 
notation, 69-71 
rank, 71 


INDEX 


Tokamak, 61 

Toroidal coordinates, 61, 62 

Tour, 517 

Trace, see Matrices, trace 

Transfer function, 293 

Transformation matrix, 73-75, 85-86, 570-574 
Translational invariance, 378, 398, 498 


Transpose, 5 


Unimodular, 633 
Unitary matrix, 633-634 


Vector fields, 18-20 

Vector potential, 40 
Vector/tensor notation, 70, 71 
Vectors 


coordinate transformations, 71-76, 90, 570-573 


identities, 14-15, 17 

notation, 3 

rotation of, 5-7, 12-13 
Vierergruppe, see Groups, D2 
Volterra equation 

first kind, 492-499 

second kind, 492-493 


Wave equation, 398-403 
Weighting function, 436 
Wronskian, 349-354 


This Page Intentionally Left Blank 


This Page Intentionally Left Blank 


This Page Intentionally Left Blank 


This Page Intentionally Left Blank 


This Page Intentionally Left Blank 


This Page Intentionally Left Blank 


